The second law and fluctuations of work: 
The case against quantum fluctuation theorems. 
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We study how Thomson's formulation of the second law: no work is extracted from an equilib- 
rium ensemble by a cyclic process, emerges in the quantum situation through the averaging over 
fluctuations of work. The latter concept is carefully defined for an ensemble of quantum systems 
interacting with macroscopic sources of work. The approach is based on first splitting a mixed 
quantum ensemble into pure subensembles, which according to quantum mechanics are maximally 
complete and irreducible. The splitting is done by filtering the outcomes of a measurement process. 

A critical review is given of two other approaches to fluctuations of work proposed in the liter- 
ature. It is shown that in contrast to those ones, the present definition i) is consistent with the 
physical meaning of the concept of work as mechanical energy lost by the macroscopic sources, 
or, equivalently, as the average energy acquired by the ensemble; ii) applies to an arbitrary non- 
equilibrium state. There is no direct generalization of the classical work-fluctuation theorem to the 
proper quantum domain. This implies some non-classical scenarios for the emergence of the second 
law. 

PACS numbers: PACS: 05.30.-d, 05.70.Ln 



I. INTRODUCTION. 



The second law was deduced in XlX'th century, and 
formulated for a single closed system, in a way resem- 
bling the laws of mechanics 0, 0, 0, BJ p Jp • It was, how- 
ever, already the insight of Maxwell 0,11] and Gibbs 
that this law has in fact a statistical character, and refers 
to averages over an ensemble of identically prepared sys- 
tems, rather than to a single system. This viewpoint 
became widely accepted since the beginning of XX cen- 
tury, where first robust observations of fluctuations were 
made 1 . Together with theoretical works of Boltzmann 
in kinetic theory of gases and of Smoluchowski, Fokker, 
Planck and Einstein in physics of brownian motion, they 
formed a consistent picture of the second law as emerging 
from microphysics through averaging over fluctuations. 
A detailed summary of this activity is presented in the 
book by Epstein 0] , while Tolman discusses theoret- 
ical aspects of the situation. Since then, the statistical 
understanding of the second law entered several modern 
books of statistical physics and thermodynamics [E Q . 
The current perspectives on the classical and quantum 
brownian motion in the context of the second law can be 
found in Refs. EE El El 

At the end of 1970's several groups independently gave 
a derivation of Thomson's formulation of the second law 
EE EE EE EE EE EE EH : no work can be extracted from 
initially canonical equilibrium system by means of a cyclic 
thermally isolated process, starting directly from quan- 
tum or classical Hamiltonian equations of motion. The 



very possibility of getting such a straightforward ther- 
modynamical result directly from equations of motion is 
due to the fact that work is a transparent quantity un- 
ambiguously defined both in and out of equilibrium for 
any (quantum or classical) system interacting with ex- 
ternal macroscopic work sources 2 . As the main conse- 
quence, Thomson's formulation is the only one which is 
valid both for finite and infinite systems which do start 
in equilibrium, but can be driven arbitrary far from it by 
external sources (see [2(J and sections III CI and IVIII for 
more details). 

The standard understanding of the second law and 
fluctuations is based on Einstein's formula relating en- 
tropy to theprobability of a fluctuation around equilib- 
rium P, IE IE • This suffices for the purposes of near- 
equilibrium thermodynamics of macroscopic bodies, in 
particular, because all the formulations of the second law 
are equivalent for them and entropy is defined unambigu- 
ously. In the more general case of finite systems and/or 
systems driven strongly out of their initial equilibrium, 
relations between the second law and fluctuations ought 
to be studied anew for each meaningful formulation of the 
law in separate. The conclusions may differ from case to 
case and from formulation to formulation 

The purpose of the present paper is to understand how 
Thomson's formulation of the second law in the quantum 
situation emerges through the averaging over fluctuations 
of the equilibrium ensemble. More specifically, if the (av- 
erage) work done on the initially equilibrium ensemble 
during a cyclic process is always non-negative, what are 



It was thus rather surprising to see recent claims on "violations 2 These features of work are in contrast to those of entropy, whose 
of t he second law" |2.'J or "transient violations of the secon d la w" meaning is too closely tied to equilibrium states of macroscopic 

24| due to fluctuations; see in this context our comment l25l . bodies. 
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fluctuations of this work, and how do they behave? There 
are definite answers to these questions in the classical sit- 
uation: the definition of fluctuations of work is straight- 
forward, and a model-independent information on them 
is given by an equality first derived by Bochkov and Ku- 
zovlev in 1978 13] (BK equality). Later on, this equality, 
sometimes also called work-fluctuation theorem, was ex- 
tended to non-cyclic processes [21|, and has undergone 
various generalizations 3 . The basic messages of the clas- 
sical BK equality are recollected and reviewed in section 

UTeI 

While all these developments concern the classical sit- 
uation, a number of recent works is devoted to quantum 
extensions of BK equality [H |H E3 H El E2 . The 
first definition of fluctuations of work in the quantum 
situation and a quantum extension of BK equality was 
actually proposed by Bochkov and Kuzovlev themselves 
[l3l Il5| . It is based on constructing an operator in the 
Heisenberg representation, associating it with an opera- 
tor of work, and thus treating work as an ordinary quan- 
tum mechanical observable pertaining to the system and 
not to the work-source. 

Another extension was initiated by Kurchan plj . 
based on two-time measurements of energy. As implied 
by the standard treatment of quantum measurements, 
this second approach is closely tied to Schrodinger rep- 
resentation. 

There are therefore two different approaches to the def- 
inition of fluctuations of work and to quantum extensions 
of BK equ ality ' both of them attracted attention recently 
[II |M IS El El, and are reviewed below in sectionlVll 
The fact that in the quantum situation these approaches 
are different is calling for attention. The difference in 
viewpoints is not completely unexpected, since the work 
as it appears in statistical thermodynamics 0, 0, EL 01 
is an essentially classical quantity (mechanical energy 
transferred from a classical source of work). 

Our objective is to propose a third definition of fluc- 
tuations of work, which is motivated by the fundamen- 
tal physics of quantum (sub)ensembles. The definition 
is guided by the following observation. Since the usual 
work is now presented as an average of a random quantity 
— for the moment we leave unspecified whether this is a 
random classical quantity or an operator — it is natural 
to require the following two conditions on its (fluctuat- 
ing) realizations and on its average: 

• Once the average work is unambiguously defined for 
any quantum system starting in an arbitrary initial 
state and interacting with a macroscopic source of 
work, the same should hold for fluctuations of work. 



3 A rather complete account of various generalizations of the clas- 
sical work-fluctuation theorem, as well as its relation with other 
fluctuation theor ems, e. g., those describing entropy production, 
is given in Refs. Bid 12(1 . Local versions of the fluctuation theo- 
rems arc also discussed there. 



In particular, one cannot restrict the general defi- 
nition to (initially) equilibrium states of the sys- 
tem. This is relevant, since one of the chapters in 
statistical thermodynamics deals with work extrac- 
tion from non-equilibrium systems 3, 40], and one 
should, of course, be able to define fluctuations of 
work in this most general situation. 

• Realizations of the random quantity work should 
have the same physical meaning of mechanical 
(high-graded) energy as the usual (average) work. 
In particular, if one happens to extract some work 
from a single realization, one should be able — at 
least in principle — to use precisely this amount 
for the standard purposes, e.g. for driving motors. 
(Basic features of work are recalled below in section 

EH) 

Both these conditions are satisfied by the classical def- 
inition, and to our opinion without them the very pro- 
gram of studying the emergence of the second law in the 
quantum situation becomes ill defined. 

It appears to the present authors that, as we discuss 
below in section IVT1 neither of the existing two quantum 
approaches — in the way they stand presently — can be 
viewed as providing a proper definition of fluctuations 
of work in the quantum situation. Both approaches fail 
out of equilibrium (no first condition), while even for an 
initially equilibrium state it is not clear that the second 
condition is satisfied. Neither these points were discussed 
in pap ers which support those definitions; see e.g., 0, 

HMmnaililEl. 

These are the reasons to introduce in the present paper 
a third approach to quantum fluctuations of work, which 
will satisfy the above two conditions. Our approach 
starts with explicitly respecting the first condition, that 
is, always defining realizations of (the random quantity) 
work as an average energy given off by the macroscopic 
source of work. If the corresponding ensemble of physi- 
cal systems already consists of subensembles, non-trivial 
realizations can be defined via the average energy ex- 
change of each subensemble with the source. (As with 
any exchange process, this is operationally characterized 
by measurements at two different times.) For a classical 
ensemble each single member completely characterizes a 
subensemble, and the classical definition of fluctuations 
of work follows naturally. In contrast, a quantum equilib- 
rium state is described by a homogeneous quantum en- 
semble, the gibbsian, which by itself does not consist of 
subensembles. This prevents us to proceed as such. First, 
an inhomogeneous quantum ensemble has to be prepared 
from this homogeneous one, a task accomplished via a se- 
lective quantum measurement. The obtained structure of 
subensembles does depend on the type of measurement, 
and as a consequence the resulting fluctuations of work 
in the quantum situation appear to be context-dependent 
(contextual) . Still some relevant characteristics of these 
fluctuations can be context-independent, as seen below. 



3 



As the main result, the second law in Thomson's for- 
mulation — whose statement reads in the same way both 
in quantum and classics — has in those two situations 
rather different scenario of emergence. The basic quali- 
tative difference is that in contrast to classics, the fluctu- 
ations of work in the proper quantum situation are not 
controlled by any direct analog of BK equality. More 
specifically, in classics the structure of work as a random 
quantity is such that there have to be realizations which 
provide work (i.e., which are active). In the quantum 
case, however, there need not be any active realization. 

In this paper we have taken the simplest approach 
which allows to study Thomson's formulation of the sec- 
ond law and fluctuations of work, that is, we consider a 
finite quantum or classical system interacting with exter- 
nal sources of work. The restriction to finite — though 
possibly large — systems is at any rate natural for study- 
ing fluctuations. The explicit presence of thermal baths, 
as well as details about the thermodynamical limit for 
the system are left for future studies. 

The paper is intended to be self-consistent and is orga- 
nized as follows. In section [H]wc recall the definition of 
fluctuations of work in the classical situation and review 
the BK equality and its consequences. In section UTTl we 
present the definition of fluctuations of work in the quan- 
tum situation. The dispersion of work is studied in sec- 
tion ]W\ In section we show that fluctuations of work 
in the quantum situation are not controlled by any di- 
rect analog of the classical BK equality. An anti-classical 
scenario for the emergence of the second law in Thom- 
son's formulation is described in section lTVl In section lVTl 
we make comparison with the two known approaches on 
fluctuations of work in the quantum situation. These ap- 
proaches offer different extensions of classical BK equal- 
ity. We do not intend to imply that these approaches do 
not have a physical meaning or that they cannot be use- 
ful for their own sake. We only state that — in the way 
they stand presently — they do not describe fluctuations 
of work in the proper quantum situation. We close with 
a discussion. Some details are worked out in appendices. 



II. CLASSICAL FLUCTUATIONS OF WORK 
AND BK EQUALITY. 

A. The setup. 

Consider an ensemble £ of identical classical systems 
S which are thermally isolated 0, Q: they move ac- 
cording to their own dynamics and interact with an ex- 
ternal macroscopic work source W. This interaction 
is described via time-dependence of some parameters 
R(t) = {Ri(t), i?2(i), ••■} of the system's Hamiltonian 
H(t) = H{R(t)}; seeRefs. Hi. 

The parameters move along a certain trajectory R(t) 
which at some initial time t = starts from i?(0), and 
ends at R(t) at the final time t — r. Cyclic thermally 



isolated processes are defined by R(0) = R(t) and thus 

H{R(r)} = H{R(0)} ee H. (1) 

At the initial time the ensemble is in equilibrium, that 
is, the common probability distribution V(x,p;t = 0) = 
V{x,p) of all its canonically conjugated coordinates x = 
(x%, ...,x n ) and momenta p = (pi, ...,p„) is given by the 
Gibbs distribution with the initial Hamiltonian H(x,p) 
and temperature T = 1//3 > 0: 



V(x,p) 



-0H(x,p) 



z 



dx dp e 



(2) 



This equilibrium distribution can be prepared by means 
of a thermal bath coupled with the system S for t < 0. 
It is assumed that for times < t < r the system S 
is decoupled from the bath — an alternative assumption 
would be that its coupling to the bath is so weak that 
it can be neglected in the considered time-interval — and 
the evolution of the ensemble is described by the Liouville 
equation for V(x,p;t): 



d{P{x,p]t) 



dH(x,p,t) dV(x,p;t) 

dp dx 
dH(x,p,t) dV(x,p;t) 



dx 



B. Work 



dp 



(3) 



In statistical thermodynamics there are two alternative 
definitions of work 0, 0, [J IE H3 • Both are necessary for 
the proper understanding of its physical meaning 0, [23, 
E2- The first reads 

• The work W is the average energy gained by iS 
during a thermally isolated system-work-source in- 
teraction with >V Hl|: 



W= J dxdp[V(x,p; t)H(x,p;t) 

-V(x,p)H(x,p;0)}. (4) 

Due to conservation of energy, W is equal to the av- 
erage energy lost by the work-source W. This definition 
was (implicitly) proposed by Caratheodory (33|. A con- 
cise history of various definitions of work is given in |38| , 
while various perspectives of work in classical mechanics 
are reviewed in [39| . 

For cyclic processes Eq. (0} takes a simpler form 



W = J dxdp[P(x,p; t) ~V(x,p)]H(x,p; 0). 



(5) 



There is a second, alternative definition going back to 
Gibbs and Planck 
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• The minus work — W is the energy transferred to 
the work-source W. Its distinguishing feature with 
respect to other forms of energy is that it can, in 
principle, be transferred with 100 % efficiency to 
other work sources via interactions of the system- 
work-source type. In particular, it can be retrans- 
ferred to collective degrees of freedom that perform 
classical deterministic motion generated by a suit- 
able Hamiltonian. These degrees of freedom are 
thus purely mechanical and serve as prototypes of 
macroscopic mechanical devices (such as a motor, 
piston, turbine, etc.). For them the differential 
work can be calculated in the usual way of ordi- 
nary mechanics, that is, multiplying the external 
force by the corresponding displacement Q . 

Both these definitions of work are expected to be equiv- 
alent at least for sufficiently ideal work sources [3. 133 , 14^ . 

Yet we mention for completeness another, equivalent 
formula for the work W ; the integral of the rate of energy 
change: 



IT = J dt I dxdpV(x,p;t) 



dH(x,p; t) 
dt ' 



(6) 



To get from here to Eq. JSJ one performs integration 
by parts, uses the standard boundary conditions, that 
is V{x,p; t) decays for x — ► ±oo or p — ► ±oo and employs 
Eq. ©. This formula for W is more general and can be 
applied to processes that involve explicit thermal baths. 



where H(x(t),p(t);t) is the value of the Hamiltonian 
on the trajectory that started at t = from (x,p), with 
x{t) and p(r) being the corresponding solutions of J7J. 
This work can be observed as the energy decrease of the 
mechanical degree of freedom of the macroscopic work- 
source, or alternatively via energy increase of the system 
S. In this latter scenario the energy of S has to be mea- 
sured two times, at the moments t — and t = r. 

The so defined work w(x,p) for a single system is a 
random quantity, since it varies from one single system 
to another. It can be positive or negative. Its probability 
distribution P(w) is determined by V(x,p), since this is 
the probability by which each single system enters in the 
ensemble: 



P(w) — / dx dpV(x,p)S(w — w(x,p)). 



(10) 



There being used no special features of the initial equi- 
librium distribution function, the same definition for the 
work in a single realization can be given for any initial 
ensemble. 

It is seen that the two desired conditions for fluctua- 
tions of work formulated in section [I] are naturally satis- 
fied: the initial distribution may be arbitrary and "work 
for a single realization" has the same physical meaning 
as average work. 



D. Derivation of BK equality. 



C. Fluctuations of work 

Though the ensemble £ is described by the probability 
distribution V(x,p), each single system S from this en- 
semble has at a given moment of time explicit values for 
all its dynamical variables. These values may vary from 
one single system to another due to the distribution of 
initial conditions. 

Each single member of the ensemble is then coupled to 
the external source of work that realizes on it a unique 
thermally isolated process (the same for all members) . In 
other words, the same parameters R(t) of the Hamilto- 
nian are varied in the same way for each member. The 
motion of the single system is described by Eq. Q with 
now V(x,p;t) being a product of two delta-functions 
8{x — x(t))5(p — p(t)), which are probability densities 
concentrated at the solutions of the canonical equations 
of motion: 



p = —d x H(x,p;t), x = d p H(x,p;t). 



(7) 



The trajectories generated by J7J), together with their 
initial conditions distributed according to Eq. @, serve 
as realizations of the random process given by Eq. J3J). 

The work w(x,p) exchanged in each thermally isolated 
process can then be calculated consistently with Eq. JSJ : 

w(x, P )=H(x(t),p(t);t)-H(x,p) (8) 
= H(x(t),p(t))-H(x, P ), (9) 



One now derives B K eq uality in the classical situation 
for a closed cycle, EUllEU: 



{e~? w ) = / dwP(w)e- l3w (11) 
dxdpV(x, P ; Q) e -P w( - X ^ 



- 1 [ A T A np -i3H(x, P )-Pw(x,p) 

z(o)J axdpe 

Z(0) J 

= W) I dx W d p( T ) e ~ mx(T) ' piT) ' T) 

Z(r) 

Z(Q) ' 



(12) 



where we used Liouville' theorem da; dp = da;(r)dp(r) 
and Eqs. 10 El EH- The last equality in JEJ is due to 
the assumed cyclic feature of the process. 



E. Qualitative messages of the BK equality. 

The BK equality is by itself an exact mathematical 
relation. Several important qualitative results can be de- 
duced from it: 
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a. The second law. As the exponential function is 
convex, one gets directly 1 = {e~^ w ) > e~' 3 ^, and then 
W = (w) > 0, which is the statement of the second 
law in Thomson's formulation: no work can be extracted 
from an equilibrium system by means of a cyclic process. 
This formulation of the second law is well-known and has 
an independent and more general derivation both in the 
classical and the quantum situation 0, Il6l IrH llsl [Tgt 

M 

b. Active realizations. To satisfy 1 = (e *"") di- 
rectly leads to the following observation: for any cyclic 
thermally isolated process there are realizations which 
are active, that is, for which work is extracted after the 
process: w(x,p) < 0. The relative weight of such active 
realizations can be estimated via the Cauchy inequality: 



F. Non-cyclic processes. 

For non-cyclic processes there is an analog of equality 
21 , which is derived in similar way with the conclusion 



( e-' Jw ) 



-/3(F(r)-F(0)) where F 



-T\nZ is 



the corresponding free energy. This relation allows to 
calculate differences of free energy via (non-equilibrium) 
measurements of work. This generalized equality is not 
directly relevant for our present purposes, because here 
we are interested by the second law in Thomson's formu- 
lation which refers to cyclic processes. 



III. QUANTUM ENSEMBLES AND THE 
DEFINITION OF FLUCTUATIONS OF WORK 



da; dp y/V{x,p) ^/V(x 1 p)e- I3w{x - P) 



< 



d X d P V( X , P )jd X dpV( X , P )e 



■2f3w(x,p) 



which can be written as 

(e~ 2f}w ) > 1. 



(13) 



(14) 



A stronger relation can be obtained with help of a gener- 
alization of the Cauchy inequality, described in Appendix 
|X| It reads: 

/ p -2/3w\ > -, , [(I/ - (/)] e l3W )] , /, 

(e >>1+ ([/ _ (/)]2) >1- (16) 

where f(x,p) is an arbitrary integrable function in the 
phase-space, and where 



(/) = dxdpV(x,p) f(x,p). 



(16) 



Eq. i|15|) is stronger than (|14f) . since now {e~ 2/3w ) is shown 
to be strictly larger than 1. Inequalities (|14l I15|l allow 
to understand how relevant the active realizations are 
with respect to both their probability and the amount of 
extracted work. 

c. Dispersion of work. For sufficiently high temper- 
atures one can make a cumulant expansion: 

1 = exp[~(3(w) + ^((w 2 ) - (w) 2 ) + ...} (17) 

which shows that for sufficiently high temperatures the 
ratio of the dispersion of work (w 2 ) — (w) 2 and its average 
increases with temperature: 



= 2T. 



(18) 



A detailed survey of various cumulant expansion- 
based results derivable from BK equality is contained in 
Refs. Ulllllli. 



A. The setup. 

The quantum setup for studying thermally isolated 
processes is a straightforward extension of the classical 
one. (We denote all operators by a hat.) 

An ensemble E of identically prepared quantum sys- 
tems S is described at t — by a density matrix p(0) = p. 
The eigenresolutions of p and of the Hamiltonian H read: 



^Pk\Pk){pk 



fc=i 



h = y^gfcigfc)(gfci, 



(19) 
(20) 



fe=i 



where {\e k )} r k L =l and {\pk)}\ = i with (£fc|ej) = (pk\pi) = 
8ki are the eigenvectors of H and p, respectively, which 
form bases in the n-dimensional Hilbert space 7i, and 
where Ek and pk are the corresponding eigenvalues. 

Frequently, but not always, we will consider initially 
Gibbsian states: 



p(0) = p 



-0H 



z 



Z = tr e 



-3H 



(21) 



-Pe k 



Pk 



En 
fc=l 



e -0e k 



\pk) = \£k), k=l,..,n, (22) 



where T=l//3>0is the temperature of the ensemble. 
We shall order the eigenvalues of H as 



ei < e 2 < ■■• < £r, 



(23) 



Then according to (|22|) . the eigenvalues of p will be or- 
dered as 



Pi > P2 > ■■■ > Pn > 0. 



(24) 



For the Gibbsian density matrix all eigenvalues are pos- 
itive. 

Analogously to the classical case, the Gibbsian state 
(|21|l is prepared for t < by letting S to interact with a 
macroscopic thermal bath, and then decoupling it from 
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the bath, so that the interaction is absent for t > 0. 
There is, however, a relevant difference between quantum 
and classical: in the quantum situation the coupling of S 
with the bath has to be weak for the stationary state of S 
to be Gibbsian 4 . A detailed analysis of this and similar 
differences between the emergence of Gibbs distribution 
in quantum and classical situations is presented in [TH 

At t = S starts to interact with an external macro- 
scopic work source W. The resulting evolution of S is 
generated by (an effective) Hamiltonian H{R(t)}, which 
is time-dependent via classical (c-number) parameters 
R(t). The evolution of S is thus unitary and has the 
same general features of reversibility as the dynamics of 
a completely isolated S. It is well known that in general 
a Hamiltonian evolution of the complete system S + W 
does not reduce to a Hamiltonian evolution for the state 
of S. However, in the present case this is achieved owing 
to the macroscopic character of W, as discussed in 

A cyclic process at the moment t — t is defined in the 
same way as in classics, that is, via R(t) = R(0), leading 
to 

H{t) = H{Q) = H. (25) 
The Hamiltonian H (t) generates a unitary evolution: 



ih-p(t) = [H(t),p(t)}, 



p{t) = U t p(0)Ul 



U t = exp 



^ Jo 



dsH{s) 



(26) 
(27) 
(28) 



where exp and exp denote time-ordered and time-anti- 
ordered exponents, respectively. 



Here U$ H(t) U t is the Hamiltonian operator in the 
Heisenberg representation at the end-time r of the cyclic 
process. The operator is sometimes called 'operator of 
work' [6I IT3LI27I . We shall show, however, in section llV Al 
that it is not clear whether it fulfill all criteria to deserve 
this identification. Moreover, the much weaker interpre- 
tation of Cl — by analogy to the classical expression JSJ 
as "energy difference operator in the Heisenberg represen- 
tation" is also incorrect in general; see section llV Al In 
our approach Cl will always appear inside averages over 
density matrices, so we do need any more particular in- 
terpretation of f2; it will only enter the definition of work 

The remarks we made after Eq. © for the classical sit- 
uation are valid in the quantum case as well. W is equal 
to the average energy decrease of the work source W. 
This is a classical, mechanical energy which can trans- 
ferred with 100% efficiency to other work-source, and, 
in particular, it can transferred to another mechanical 
degree of freedom performing classical deterministic mo- 
tion. In that respect both the classical and quantum def- 
initions are consistent and can be indistinguishable from 
the viewpoint of this mechanical degree. This property 
is the underlying reason why phenomenological thermo- 
dynamics, where not any (quantum or classical) identifi- 
cation of S is given, can exist. 

The work is typically observed via suitable (classical) 
measurements done on the work source, or, alternatively, 
by measuring the initial and final average energies on the 
ensemble £. Both these ways are routinely employed in 
practice, e.g., in NMR/ESR physics, where the system S 
corresponds to spin-i under influence of external mag- 
netic fields |50| . 

Finally, the quantum analog of formula © reads: W — 

and Eq. I|29(l can be recovered from 



dH(t) 



dt 



J T dttr[p(t) 

this formula upon integration by parts and using H26[) . 



B. Work. 

The whole discussion in section Hi Bl directly applies in 
the quantum situation, except that S is now a quantum 
system, and Eqs. (@] |3J) should be substituted by their 
quantum analogs (i.e., V — > p, H — > H and j dxdp — > 
tr). More specifically, the work W done by the external 
source W is identified with the average energy gained by 
the ensemble 0,0 

W = tr[ ( o(r) H - p H] = tr p O, (29) 
where we denoted 

Cl = U} H{t) U T - H = UlHU T - H. (30) 



C. Quantum ensembles. 

The definition of fluctuations of work in the classical 
situation was based on the distinction between classical 
ensemble of systems described by a probability distribu- 
tion versus a single member of that ensemble. It should 
not be surprising that fluctuations of work in the quan- 
tum situation are closely tied to the meaning of what is 
a quantum ensemble. 

Thus, for our further purposes we need an account of 
various features of quantum ensembles and their differ- 
ences with respect to the classical ones. There are several 
sources in literature O EE HE EE 03 where this 
type of questions is studied with special attention °. 



4 Due to weak coupling to the bath, the energy costs for switching 
the interaction on and off become negligible. This holds both in 
the quantum and the classical situation. 



J Though the theory of quantum ensembles is almost as old as 
quan tum mecha nics itself, it still attracts lively discussions; see 
e.g. 64, 65, 66 1 . It is interesting to note the basic differences 
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1. Statistical interpretation of quantum mechanics. 

Within the standard quantum mechanics a quantum 
'state' is described by a density matrix p. Any state, 
including a pure state IV'KV'Ii describes an ensemble of 
identically prepared systems For instance, in an ideal 
Stern-Gerlach experiment all particles of the upper beam 
together are described by the wavefunction | |) or the 
pure density matix | t)(T I- The description is optimal, in 
the sense that all particles have o~ z = +1, but incomplete 
in the sense that their a x and o~ v are unknown: upon 
measuring either of them, one will get ±1 with equal 
probabilities. 



2. Homogeneous ensembles. 

In general, a density matrix p can be applied to de- 
scribe two types of quantum ensembles, homogeneous and 
inhomogeneous. 

For a homogeneous ensemble £ (p) only the density ma- 
trix p is given and no further specification is made about 
a single system S from that ensemble. A typical exam- 
ple is an ensemble prepared by thermalization, that is, by 
letting each single system S to interact weakly with an 
equilibrium thermal bath, and waiting sufficiently long 
till the equilibrium state of S is established. 

Let us study the defining feature of homogeneous en- 
sembles in more details. We start by comparing them to 
classical ensembles. In the classical situation, the descrip- 
tion of an ensemble by means of a probability distribution 
still implies that each single system has definite values for 
all its variables. For a homogeneous quantum ensemble 
£ (p), only those observables (hermitian operators living 
in the Hilbert space H) A that are dispersionless on £(p), 



tr (A pj = tr (A 2 p 



(31) 

can be said to have definite values for all single systems 
S from £{p) . Indeed, it is shown in Appendix ICl that 
dispersionless observables satisfy 

Ap = ap, (32) 

where a is a c-number. This implies 

~ r * i m 

tr(A m p) = trAp , m = 0, 1, 2, 3..., (33) 

and the above statement follows. For a pure state p = 

we return from (|32|l to the standard notion of 
being an eigenstate of A. 

Any other, non-dispersionless observable B — even if 
it commutes with the density matrix p — does not have 



a definite value in a single system S from £ (p). It is 
true that for [p, B] = 0, £{p) can be prepared by mix- 
ing 6 pure states ensembles {£ (\pk)(Pk\) }k=i with prob- 
abilities {p fc }fc =1 , where { \p k ) }£ =1 and {p k }k=i are, re- 
spectively, the common eigenvectors of p and B and the 
eigenvalues of p. If £{p) is known to be prepared in such 
a way, then B has indeed definite values for each sin- 
gle member of £. However, in general this need not ap- 
ply, since there are (infinitely) many other ways to pre- 
pare the same ensemble £{p) via mixing N subensemblcs 
with density matrices {\tp a )(ipa\}a=i and probabilities 
{\ a }a=i- They correspond to the (infinitely) many ways 
in which the hermitian operator p can be decomposed as 



N 

P=^2 K\lpa)(lpc 



I, K > o, 



N 



X n — 1, 



(34) 



a=l 



where \ip a ) are some normalized — but in general not 
orthogonal — vectors living in the same n-dimensional 
Hilbert space Tt 7 , and where \4'a}(' t l'a\ are distinct. 

The eigenresolution 119|) is only a particular case of 
(|34|l . and if now the ensemble £ (p) was prepared by one 
of the ways corresponding to l|34l) with non-orthogonal 
\4> a ), the constituents of £{p) come from the subensem- 
bles £{\4>a){ipu\)} and the observable B has in general no 
any definite value for these subensembles. 

The above discussion allows to conclude with two re- 
lated features of a homogeneous ensemble: i) a single 
member of such an ensemble does not by itself define a 
subensemble; ii) the ensemble cannot be thought to con- 
sist of definite subensembles. 



3. Pure-state ensembles. 

The description of a homogeneous ensemble via pure 
density matrices, p 2 = p, has several special features. 

First of all, it is seen from l|34|l that for a pure state p = 
\ip) (-01 in the RHS of representation (|34|l only one term 
shows up: \tp){ip\ = (tp\ 8 - Thus, pure-state ensembles 
cannot be prepared via mixing of other ensembles of the 



between classical and quantum ense mble s were correctly under- 
stood by Elsasser as early as in 1937 l43l . 



6 Mixing ensembles £(pi) and £{f>2) with probabilities pi and p2, 
respectively, means that one throws a dice with probabilities of 
outcomes equal to p\ and p2, and depending on the outcome one 
picks up a system from £(pi) or £(p2), keeping no information 
on where the system came from. Alternatively, one can join 
together Npi systems from £(pi) and Np2 systems from £{f>2) 
(N 3> 1), so that no information information is kept on where a 
single system came from. Then any subensemble of M systems 
(N S> M) is described by the density matrix p = pi pi + P 2 P2- 
Note that the restriction N 2> M is important, see, e.g., 1651 . and 
some confusion arose in literature for not taking it into account. 

7 Normalization and belonging to H. are necessary for |V'q){V'q I to 
describe some ensemble of the systems S. 

8 This can also be deduced from a more general result: for any 
\ip a ) that can appear in 1341 . either p = \^ a )(ipa\, or \ip a ) is 
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system <S, or, put differently, pure-state ensembles are 
irreducible. 

Second, this description is the maximally complete one 
possible in quantum mechanics. This known thesis can 
be substantiated as follows. First one notes from (|31l 
I32[l that for a fixed p dispersionless observables form a 
linear space: if two operators are dispersionless, so is 
their sum, and multiplication by a number conserves the 
dispersionless feature. 

From 1)32(1 and Appendix Icl one sees that if the mixed 
density matrix p has k, 1 < k < n, non-zero eigenvalues 
(n being the dimension of the Hilbert space H), then the 
dimension of the linear space formed by the correspond- 
ing dispersionless observables is equal to 

N k = (n - kf + 1. (35) 

This number is maximal for k = 1, that is, for pure 
density matrices. In other words, pure density matrices 
provide definite values for a larger set of observables than 
mixed density matrices 9 . For a mixed state all disper- 
sionless observables have to be degenerate. 

Though the above two features of irreducibility and 
completeness create a conceptual difference between pure 
and mixed density matrices, this should certainly not 
be taken as an invitation to prescribe pure density 
matrices to a single system, reserving the mixed ones 
for ensembles; further reasons for this are analyzed in 

Refs. IHE1E1E1E1E1 io . 

4- Inhomogeneous ensembles. 

A mixed density matrix p can also describe inhomoge- 
neous ensembles. Such an ensemble E\ is a collection of 
homogeneous subensembles {£{p a ) }a=i with probabili- 
ties { A a }„ =1 , so that each single system from E\ is known 
to be taken from the ensemble £{p a ) with probability A a , 
a = 1,..,N. Obvious cases are when the subensembles 
£{p a ) are separated in space or in time, or by means of 
some other classical quantity. 

Inhomogeneous ensembles are typically prepared by 
means of selective measurements 11 . In that case the 



orthogonal to the linear space formed by the eigenvectors of p 
corresponding to eigenvalue zero. Indeed, let |0) be one such 
eigenvector, then (0|p|0) = £ Q \ a |(0|V> Q }| 2 = 0; thus |<0|Va) = 
for \ a > 0. 

For k = n we get 2Vfc = b since in this case only operators 
proportional to unity are dispersionless. For n = 2 and k = 1, 
N/, = 2: all dispersionless observables for a two-dimensional pure 
density matrix can be represented as ce\ip)(tp\+fi\rp±)(ipj_\, 

where (ip\ip±) = 0, and where a and /3 are two independent real 
numbers. 

Among reasons we find convincing is the analysis of the quantum 
measurement process |4ll . 

These measurements need not be done on the systems S di- 
rectly, they can be indirect as well. Imagine an ensemble of two 
spin-i particles described by pure density matrix \ip)(^)\, where 



above classical quantity is the corresponding record of 
the macroscopic apparatus by which this measurement 
was done. Below in section MI El we describe in detail 
how an initially homogeneous ensemble can be separated 
into subensembles by means of a measurement. 

The inhomogeneous ensemble £\ is still described by 
the overall density matrix p — Yl a =i ^aPa, but in con- 
trast to the homogeneous situation this is not the full 
description. The latter is provided by the list 

{Aa.MSU- ( 36 ) 

So more information is known about the inhomogeneous 
ensemble S\ then only p. If the inhomogeneous ensem- 
ble is just a combination of homogeneous ones, this is 
obvious. If the inhomogeneous ensemble was prepared 
by means of a measurement, then the above information 
results from the measurement carried out and from se- 
lection of the outcomes (see more details in section UlI El 
below) . 



5. Prescribed ensemble fallacy. 

This fallacy rests on forgetting the difference between 
homogeneous and inhomogeneous ensembles |46l l6l| . In 
spite of explicit warnings [3|, the fallacy frequently ap- 
pears in applications and interpretations of quantum sta- 
tistical physics. Consider, for example, the basic tool 
of statistical physics, the equilibrium ensemble described 
by the Gibbsian density matrix l|21l) . It is typically ob- 
tained by thermalization process, that is, due to inter- 
action with a thermal bath. One sometimes hears with 
respect to this ensemble that it represents the system 
being in states of definite energy with the corresponding 
probabilities This is a valid description of the en- 
semble only after the measurement of energy H has been 
done, something which is by itself not typical in applica- 
tions. Moreover, as we recalled above and below, one can 
choose to make a different measurement, and then the in- 
terpretation in terms of definite energies will be explicitly 
wrong. The reason of why some applications — though 
starting from the above incorrect premise — do not lead 
to contradictions is clear: they use this premise merely 
for "explanations of what actually happens" , while in real 
calculations and comparisons with experiment only the 
density matrix (|21|) is employed. 



|V>} = 7f (l+>i ® l+>2 + |-)l ® |->2), and where |±)i, 2 are the 
(1 2) 

eigenvectors of a z with eigenvalues ±1 for the first and second 
particle, respectively. One can now measure 8« , and keep both 
the results of these measurements and the order of their appear- 
ance (thus, one keeps a sequence of random numbers ±1). For 
the subensemble of the second spin this amounts to preparation 
of inhomogeneous ensemble {i, | +} 2 2 {+ 1 ; ^,|— }22(— |}- 
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D. Fluctuations of work. 

Once the properties of quantum ensembles are clari- 
fied, we can proceed with the quantum definition of fluc- 
tuations of work. The most reasonable way to define this 
concept in the quantum situation is to proceed along the 
same lines as in classics, taking into account when needed 
the differences between quantum and classical ensembles. 

It is convenient to separate the definition into the fol- 
lowing steps. 

1. The initial ensemble £{p) is homogeneous, since 
it was prepared by means of a thermal bath. With 
help of a suitable measurement (see section IIII El for 
details), one separates £{p) into irreducible, maximally 
complete subensembles { £(\ip a )(ip a \) }^=i with proba- 
bilities {\ a }a=i, so that the resulting inhomogeneous 
ensemble is still described by the same density matrix p 
and thus l|34|l is valid. 

In the quantum situation irreducible, maximally com- 
plete subensembles are described by pure density matri- 
ces |^>)(^>|, as we recalled above. The important point is 
that these subensembles play here the same role as the 
single systems for the classical definition of fluctuations 
of work. 

Note that once it is understood that the initial en- 
semble £{p) is homogeneous and that measurements are 
anyhow needed to make it inhomogeneous, we have to 
admit any measurement which will produce pure-state 
ensembles, even those with non-orthogonal I^q^'s. 

Recall that the present step of preparing an inhomo- 
geneous ensemble out of the initial homogeneous one is 
absent in the classical situation, simply because there are 
no essentially inhomogeneous classical ensembles. 

2. This step almost literally repeats its classical 
analog. The single systems from each subensemble 
£(\ip a ) (ip a \) interacts with the work source which real- 
izes the same thermally isolated process on each single 
system from each subensemble. 

The evolution of the corresponding subensemble dur- 
ing the cyclic process between times and r is given by 
the von Neumann equation 

ih—p a (t) = [H(t),p a {t)], p a (0) = \ip a )(ip a \ (37) 

p a (r) = U T p a (0)U].. (38) 

3. In analogy with the corresponding classical step 
we define the work w a done on the subensemble a via 
Eq. ©: 

w a = tr (^Cl \ip a )(ip a \ ) 

= (Mr)\H\Mr)) ~ (M0)\H\M0))- (39) 

This is the average energy decrease of the mechanical 
degree of freedom of the work source due its interaction 
with the corresponding subensemble. Thus w a has the 
meaning of work by itself, but it is a quantity that had 
to be averaged over the subensemble. The probability 



of w a is equal to A Q , since, as seen from (|34[1 . this is the 
probability by which the corresponding pure subensemble 
enters the overall ensemble described by p. 

Thus we defined a random c-number quantity work w 
with realizations w a and probabilities X a : 

w = {w a , A a }^ =1 . (40) 

As follows from (|29l I34J) the work done on the overall 
ensemble is equal to the weighted average over the pure 
subensembles: 

N 

W = Y,^w a . (41) 

a=l 

Eq. 1)41(1 remains true for any initial ensemble. It is 
straightforward to see that our definition of fluctuations 
of work can be applied to any initial ensemble and not 
only to that described by the Gibbsian density matrix 

mi- 

The thus defined fluctuations of work do depend on the 
pure ensembles {|V'a)(V'a|}^Li, defined uniquely once the 
measurement separating the overall ensemble into pure 
subsensembles is specified. Strictly speaking, what we de- 
fined as fluctuations are the ones between subensembles 
(inter-subensemble fluctuations). Within the standard 
quantum theory we do not know how to define fluctua- 
tions of work inside of a irreducible subensemble. There 
were in literature some attempts in this direction, which 
are described in section IVll However, they do not satisfy 
the natural conditions on fluctuations of work, as out- 
lined in the Introduction (arbitrary initial state; proper 
physical meaning). In particular, the approach based on 
the "operator of work" is not applicable, since we will 
explain that this operator does not satisfy the proper 
criteria 12 . 

If there is no interaction with any work source, that is, 
the Hamiltonian H is time- independent, and if in addi- 
tion [p, H] = 0, then the ensemble described by p is sta- 
tionary: all (one-time) averages are time-independent. 
Now note that the stationary ensemble can be decom- 
posed into non-stationary subensembles, since in general 
[\ip a )(ip a \ , H] 7^ 0. This is clearly impossible for a clas- 
sical ensemble, but in the context of fluctuations of work 
this fact implies nothing pathologic, since work is defined 
for any initial ensemble, not only for stationary ones. It is 
checked from (|39|l that if there is no interaction with the 
work-sources, then = 0, and all possible realizations of 
work are zero. 

4- Note that for macroscopic systems it is not re- 
alistic to have available measurements producing pure- 



Thus if these fluctuations exist, and we assume they do, their de- 
scription seems to be outside of today's theories. It might be of 
some interest to see whether more detailed definitions of fluctua- 
tions of work can be given in theories of subquantum mechanics, 
e.g., Bohmian or Nelsonian mechanics. 
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state subensembles, since the directly availabale measure- 
ments are only those of macroscopic quantities which are 
typically degenerate. In this case we may need to ap- 
ply a coarse-grained definition of fluctuations of work, 
where the initial mixed ensemble is separated into mixed 
subensembles described by density matrices <7 7 (ct 7 ^ <r 7 ) 



= ^ 7 (T 7 , > 0, ZAy = 1. 



(42) 



The definition then proceeds as above with obvious 
changes (e.g., \ijj a ){ip a \ ^7 m EHl )• 

This is a coarse-grained definition, since the realiza- 
tions of work tr (ft ct 7 ) can be reduced to more fundamen- 
tal ones, i.e., each of them can be presented as a convex 
sum of tr (lipajfyal ^ )■ As a consequence fluctuations of 
work — as quantified, e.g., by dispersion of work defined 
and discussed in section lTvl — are maximal for pure-state 
decompositions (more details on this are found in section 

EH . 



E. Separation of a homogeneous ensemble into 
pure subensembles by filtering outcomes of a POVM 
measurement . 

1. Positive Operator Valued Measurements. 

It is now our purpose to discuss how precisely one 
separates with help of measurements an initial homoge- 
neous ensemble £{p) into pure (necessarily homogeneous) 
subensembles. 

The most general type of a quantum measurement cor- 
responds to Positive Operator Valued Measure (POVM) 
|46l |4^| defined via N operators G a — not necessarily 
orthogonal — living in the n-dimensional Hilbert space 
H and satisfying the completeness relation 



JV 



GaG c 



(43) 



The most standard measurements of an observable A 
living in the n-dimensional Hilbert space TL and having 
non-degenerate spectrum {a Q }a = i are included in (|43|l . 
since now N = n and {G Q }™ =1 = {|a Q )(a a |}™ =1 , where 
the latter is the set of orthonormal eigenvectors of A. If 
the spectrum of A happens to have degeneracies, so that 
each eigenvalue a a has multiplicity n a , then G a is the 
n Q -dimensional projector on the subspace formed by n a 
linearly independent eigenvectors of A which correspond 
to the eigenvalue a a . Here N < n is equal to the number 
of distinct eigenvalues of A. 

If the measurement described by l|43|l is done on the 
ensemble described by a density matrix p, then the result 
a is found with probability 



Xn 



tr(G^G a p)=tr(G a pG^), 



(44) 



where X a > and J2 a =i ^ a ~ 1> ^ ue to After se- 

lecting results of the measurements referring to the out- 
come a one has the (sub)ensemble of systems described 
by a density matrix 



G a pGl t 
tr(G a G a p) ' 



(45) 



This subensemble occura with probability A Q as given by 
()44p. simply because this is the probability of the out- 
come a. The overall post-measurement inhomogeneous 
ensemble thus consists of N subensembles each of which 
has a density matrix (|45|l and probability (|44|l . The den- 
sity matrix of the overall post-measurement ensemble is 



JV 



P a - 



(46) 



POVM's are related to more usual projective mea- 
surements, where G a are mutually orthogonal, G a Gp = 
Ga5 a p, projections into eigenspaces of some hermitian 
operator: according to Neumark's theorem [46T.I47I ] every 
POVM can be realized as some (non-unique) projective 
measurement in a larger Hilbert space, that is involving 
additional degrees of freedom. A detailed discussion of 
this theorem and various versions is given in 0, |47| . 
In Appendix \D\ we shall discuss an example of it that is 
relevant for our purposes. 



2. 



Applying a POVM measurement, one now wishes to 
separate the mixed quantum ensemble described by the 
density matrix p into pure subensembles. The den- 
sity matrix p' of the overall post-measurement ensembles 
should then coincide with p given in l|19|) or l|21|) . while 
p' a appearing in (|45|l should be pure: 



Pa = |V>a)(V>o 



(47) 



Then the density matrix (|21|l is decomposed as in (|34ll . 

Let us first see which {A Q }^ =1 and are 
allowed to enter in l|34f> . and then we shall discuss which 
specific measurements should be done to achieve the ac- 
tual separation. 

It will prove useful to write l|34fl in an equivalent way 



JV 



"J2\ip a ){tp a \, 1% 



X a \lp a ) 



(48) 



a=l 



since it will allow us to focus on \ip a ), keeping in mind 
that the probabilities X a can always be recovered via 

A Q = (4> a \lpa)- 

According to the ensemble classification theorem |5ll 
El El El, if one has 



\lpa) = E Mak VPk\Pk), 



(49) 



k=l 
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where {Pk}k=i> i\Pk)}k=i are tne eigenvalues and eigen- 
functions of the density matrix p, and where M ak are 
complex numbers satisfying 



iV 



53 M ak M* aj 



k,j = 1, 



(50) 



Q = l 



then Eq. (|4"8l becomes p = Ylk=iPk\Pk){Pk\i as it should 
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The converse appears to be true as well: any decompo- 
sition (|48|l admits a representation (|49|l with some com- 
plex numbers M ak satisfying l|5U|) . 

Note that Eq. (|49|l implies the following formula for 
the probabilities X a : 



K = (i/) a \il> a ) = ^2\M ak \ 



Pk- 



(51) 



k=l 



As seen from (|50f) . the very possibility of writing 
Eq. igSJl implies 



N > n, 



(52) 



since M a & can be viewed as n different iV-component 
orthonormal vectors. The rectangular matrix 

{{M a k}a=i}k=i can be completed to a unitary N x N 
matrix by adding suitable elements. 

It is now straightforward to see which POVM can be 
taken to achieve the decomposition Q48[l. Take, for ex- 
ample, 



-1/2 



(ipa | Ipa) 



^\^ a )(^ a \r 1/2 , (53) 



where \ip a ) is defined in Ij48(l . Note that the converse 
appears to be true as well. For given POVM 143|) with 



where \n a ) satisfying 



N 



53 k«)<7r a | 



(54) 



(55) 



Note that any vector \4>a) having (ipafya) < 1 and living in the 
Hilbert space formed by the eigenvectors of p corresponding to 
its non-zero eigenvalues, can appear in at least one separation 
1351 of p. This follows from 

To prove this part of the statement, recall Footnote |8j expand 
\ip a ) over the eigenbase \p k ) ofp: \ip a ) = Sfc=i(P*IV'a> \Pk), sub- 
stitute this into 1481 . and then deduce 1501 using the orthonor- 
mality and completness of the above base in the Hilbert space H: 
Ea=i(PfclV'a> (ipalPi) = {Pk\p\pi) = S k iPk- Thus, any decompo- 
sition I48II34I can be constructed via 1491 and M ak (p^ \i/j a ) / y/Pk 
satisfying 1491 . If some eigenvalues of p are equal to zero, than 
the above construction should be restricted to eigenvectors of p 
corresponding to its non-zero eigenvalues. 



have to be neither orthogonal, nor normalized 15 , one can 
construct a representation 1481 134|) of p as 



N 
a=l 



1/2 



7T a )(7r a | p 



1/2 



(56) 



Thus, we have seen how all possible decompositions 
of a mixed ensemble into pure subensenbles can be con- 
structed via suitable measurements. 

We stress that the decompositions into a specific set 
of subensembles is related to a physical measurement, 
rather than to a mathematical choice. 



3. Preparation versus measurements. 

To avoid possible confusions we recall once again 
that the above separation procedure corresponds 
to preparation of the inhomogeneous ensemble 

{ A Q , £(|V>a)(^a|)}a=l With p = J^a A <* | Ipa) (ipa \ , 

starting from the initial homogeneous ensemble £(p). 
Though this preparation was based on a suitable 
measurement process, we were not interested by some 
aspects usually associated with it. For example, we did 
not keep track of the pointer variable of the measuring 
apparatus, which obviously should be the main goal of 
any measurement process studied for its own purposes 
kll. We were more interested by the influence of the 
measurement process on the final state of the system 
S, which is the basic characteristic feature of the 
preparation process in quantum mechanics |47^. 



F. Discussion. 

There are several questions on the physical meaning of 
the proposed definition of fluctuations of work that we 
decided to discuss separately. 

Question 1. Among all decompositions l|34|) of the 
Gibbsian density matrix p, there is a unique one (up to 
accidental degeneracies of the spectrum) given by the 
eigenvectors of p and realized via measurement of the 
Hamiltonian H . Then the energy has a definite value on 
each subensemble. Should not one therefore restrict the 
definition of fluctuations of work to this separation only? 

Answer 1. There are at least two reasons why the an- 
swer is no. First, even if the energy has a definite value 
initially, it will in general not have any definite value 



15 If one assumes in Eq. that |7r a ) are normalized, 

(ir a \iT a ) = 1, then this leads to orthogonality: (ng \"T a ) = 



Indeed, denoting IT 



Ka>(To|, one gets 



"a/3- 

n/3)n^ = 0. Since ^n a fl^^ ^11^11,3^ is non-negative by con- 
struction, one concludes n Q ri^ = for a ^ 0. 
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at the final moment, since an eigenvector of the initial 
Hamiltonian may evolve into a superposition of eigen- 
vectors. Thus, there are no special reasons to insist on 
the feature of separating with respect to energy. Second, 
more general separations are anyhow necessary to define 
fluctuations of work for an arbitrary ensemble, which can- 
not be decomposed into subensembles with each of them 
having a definite value of energy. 

Question 2. Is the orthogonal separation not special by 
the fact that various ensembles are described by orthogo- 
nal pure density matrices, and can thus be discriminated 
unambigiously? 

Answer 2. By definition any POVM is connected with 
an unambiguous descrimination of its different outcomes. 
This can be additionally clarified by looking at the ex- 
ample of the projective realization of a POVM presented 
in Appendix iDl where various subensembles constructed 
after the measurement are seen to be described by or- 
thogonal wave-functions in the composite Hilbert space 
7i ® W. The above question mixes the present situa- 
tion with a different one, where one is given a single 
system coming from one of two ensembles having non- 
orthogonal density matrices, and is requested to deter- 
mine by means of a measurement from which ensemble 
it is coming. Then, indeed, no measurements can ensure 
unambigious discrimination |4ol |. 

Question 3. The authors prescribe to the viewpoint 
that even pure density matrices (wave functions) describe 
an ensemble of quantum systems and not a single system, 
as some people like to think. How the proposed defini- 
tion will change, if one would wish to insist on the latter 
interpretation of quantum mechanics? 

Answer 3. The necessity of prescribing even the pure 
density matrices to ensembles of quantum systems was 
stressed in 0, 0, 0, 0. In particular, it is needed 
for the consistent solution of the quantum measurement 
problem [ill |47| . It is also known that with respect 
to certain aspects of quantum theory the prescription 
of pure density matrices to a single system is relatively 
harmless. We do not have space to discuss in detail what 
are those aspects and what precisely is meant by "rela- 
tively harmless" . At any rate, we do not advise to make 
the latter prescription, and the readers who wish to do 
that have to proceed on their own risk. We may mention 
that the definition of fluctuations of work remains then 
basically unchanged, but becomes conceptually closer to 
its classical analog, since now in defining fluctuations of 
work one assumes to operate with single systems both in 
quantum and classical situations. 



IV. DISPERSION OF WORK. 

The most direct quantity that characterizes how the 
realizations (|39|l of the random quantity work are spread 
around their mean W = ^2 a= i^aW a , is the (inter- 



subensemble) dispersion 



Sw 2 = J]A Q [(</; Q |n|V>a)-tr(fip)J ' (57) 

a=l 

N N 

= ^TAaK-WO 2 = Y, x <*wl-W 2 . (58) 



In contrast to W , this quantity depends explicitly on the 
subensembles used to define w a in i|39[) . So it depends ex- 
plicitly on the physical process that separated the initial 
ensemble into subensembles. 

It is useful to to determine the maximal <5u^ ax and 
the minimal Sw 2 lin values of Sw 2 over all possible decom- 
positions {\ip a )(ip a \, X a }a=i corresponding to the fixed 
p = Yl a =i ^a\ipa)(*f>a\- According to (fSHf) these extrem- 
izations can be equally well carried out over all possi- 
ble decompositions of unity in our n-dimensional Hilbert 
space, 



N 

E 



n, 



n, 



|7Ta)(7r 



(59) 



where {\^ a )}a=i have in general to be neither normalized 
nor orthogonal. 

Note that dispersions similar to l|57l) . with £1 corre- 
sponding to some other relevant observable, where intro- 
duced and studied in quantum optics, where separation 
of an ensemble by means of (continuous) measurements 
are well-known and were studied both experimentally and 
theoretically; see for a review. The results we present 
below on the minimal and maximal values of the disper- 
sion Sw 2 do not depend on the details of £1 and can thus 
be useful in general. 



A. Maximal dispersion of work. 

The maximization of 5w 2 over all possible separations 
i|5Hl5l?)l for given p and (l is carried out in Appendix IU1 
The result is 



i,k=l 



^PiPk 

Pi +Pk 







2 dstr 




Jo 





|(e fe |Jl| £l )| 2 -^ 2 (60) 



W 2 . (61) 



This maximum is reached on {|7r Q )}™ =1 being the eigen- 
vectors of a hermitian operator 



i,k=l 



Pi + Pk 



(62) 



where pk and \pk) are the eigenvalues and eigenvectors of 
p, as defined by lj*H3|) . 

Only when p and Q commute, [p, &] = 0, the maxi- 
mal dispersion (|60|l reduces to the more usual expression 
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tr[/5f2 2 ] — [tr(/3f2)] 2 . This and related questions are dis- 
cussed in more detail around Eqs. (|101lllU2l) . 

The maximal dispersion 1(601 161(1 provides an upper 
bound for the dispersion of work defined in a coarse- 
grained way; see the discussion around Eq. 1(42(1 . Indeed 
according to that discussion the coarse-grained disper- 
sion of work defined with respect to separation of £ (p) 
to mixed-state subensembles reads 



<5< = ^ 7 (tr(a 7 0) - T40 2 

7 



(63) 



Note a decomposition of <r 7 into some set of pure-state 
subensembles, <r 7 — J2 a A^IV'o^KV'a 7 ^, where fia^ are 
the corresponding probabilities with J2 a ^^ = One 
now finds that the dispersion 5w 2 defined as in Eqs. 1)571 
155) 1 . that is, via the separation of the ensemble £{p) into 

pure-state subensembles p = J2 a 7 ^Mq 7 ' l^Pa 1 ^) (V'i 7 ' |> is 
always not smaller than Sm 2 '. 



Sw 2 ~ Sw 2 cg > ]T ^ ( <^ 7) " W ) 2 - 5w, 

,7 

E^i 7) (<v£ 7) |ftw& ) ) 



a, 7 



a, 7 



E^Wf'lV)] >0- (64) 

/3 



i. 27ie behavior of the maximal dispersion (5iuJ, ax for high 



B. Minimal dispersion of work. 

Here we show that there are decompositions into 
subensembles such that for any a = 1, N: 

N 

Wa = {ip a \n\ip a ) = J2 x p w p = w > ( 6? ) 

0=1 

that is, the work does not fluctuate at all. In particular, 
this means that the dispersion 5w 2 attains its minimal 
value equal to zero. This fact is contrasting to the classi- 
cal situation, where according to points b and c in section 
III El w(x,p) should be negative at least for some values 
of (x,p), and the dispersion of work is large at least for 
sufficiently high temperatures. 

Recall that due to the parametrization l|54l 1561 159) 1 . 
Eq. 16 7|) can be written as 



tr(pfi), 



(68) 



and low temperatures. 



(■K a \p\lT a ) 

where {\^ a )}a=i with N > n have to satisfy 1(591) . This 
is equivalent to 

= (n a \Y\ir a ), (69) 

Y = p 1 / 2 n Q p 1 / 2 -tr [tip] p, (70) 

where Y is hermitian and traceless: 

trF = 0. (71) 

We now intend to show that in the Hilbert space 7i 
there are orthonormal bases {|7Ti)}" =1 which for the given 
Y do satisfy to 1)57 ) 1 5$ . 



With p given by the Gibbs distribution (EH d E2I , 
one gets from Eq. l)50")) 

<5< ax ^0, for T^O, (65) 

where T is the temperature of the Gibbsian ensemble. 
This is a natural result, as for a finite system S and T — > 
one gets p — > |eo)(eo|, where according to PU1H9^I . \eq) is 
the common eigenvector of p and H corresponding to the 
lowest energy (assuming that the latter is not degener- 
ate) . As no separation of a pure state into subensembles 
is possible, the work can take only one value. It is ob- 
vious that this is a general feature: the work does not 
fluctuate if the initial ensemble is pure. In the same way 
as in classics, fluctuations of work are present for mixed 
ensembles only. In this respect the dispersion of work 
is similar to the von Neumann entropy SVn = — tip hip, 
which is also equal to zero for pure density matrices p. 

For very high temperatures, where p ~ i/n, one gets 
from 1(50)1 

<5<ax = I tr (n 2 ) . (66) 

It is seen that for high temperatures the maximal disper- 
sion may be 0(1), provided that the (positive) eigenval- 
ues of ti 2 are finite and do not scale with n. 



1. Some concepts from majorization theory. 

To this end, let us recall some concepts from the math- 
ematical theory of majorization [5^ |53, OS, |53 . ^ or tw0 
real vectors x — (x% > ... > x n ) and y = (yi> ... > y n ), 
with their components arranged in non-increasing way, y 
is said to majorize x, 

x ■< y, (72) 

if the following conditions are satisfied 
k k 

E^'-E^' fc = l,..,n-l, (73) 

i=l i=l 
n n 

i=l i=l 

Due to Horn's theorem [H |H |H i| , Eq ® im P lies 
the existence of a n x n unitary matrix Qij such that 

n 

%i=^2yj\Qij\ 2 - (75) 
fc=i 

The proof of this statement is recalled in Appendix [0 
This proof is constructive, since it allows to gets Qij 
starting from given x and y. 
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2. The minimal dispersion of work is zero. 

Now denote by (yi > ... > y n ) the eigenvalues of the 
hermitian matrix Y arranged in non-increasing way. De- 
note by {|j/i)}f =1 the corresponding eigenvectors. As fol- 
lows from ' (HI E3 El 



(yi,...,y„) y (0,...,0). 



(76) 



According to (|75|l there exists a unitary operator Q in 
the Hilbert space H such that 

n 

= $>j l(%IQI^>| 2 = (w| Q ] YQ\ Vl ). (77) 
j'=i 



By denoting 



Q\Vi)=\n), i = l,...,n, 



(78) 



we see that l|77[l and the desired statement (I69[) are equiv- 
alent. 



C. Dispersion of work averaged over all separations 
of the ensemble. 

We have obtained the maximal and the minimal values 
of the dispersion of work Sw 2 . It is useful to have a 
third characteristic value of Sw 2 , the dispersion of work 
for a randomly chosen separation of the initial ensemble 
described by p into pure subensembles. Such a quantity 
will not depend explicitly on the measurement used for 
separation, and thus will help to understand how typical 
are the maximal and the minimal values of Sw 2 . 

Note from Eqs. (|49II50|I that for a given separation of p, 
that is, for a given representation l|34f> . the pure density 
matrices |V ; a)('0a| are expressed via elements M a i of a 
N x N unitary matrix M (see the remark after l]52[l'). We 
shall define the average dispersion Sw 2 v by assuming that 
M is random, and then integrating Sw 2 {M a i} over all 
possible unitary N x N matrices. Since there are no rea- 
sons for introducing a priori biases, we shall assume for 
the above integration the most uniform, unitary-invariant 
measure (Haar's measure): 



/ nfa=i d^M ai d$M ai Q{M ai } Sw 2 {M ai } 



(79) 



where ®{M a i} comes due to the unitarity constraint 



N 



9{M m } = 11 S 



N 



N 



IP 



. a<0 Li=l 



A' 



J2 M ™ M > 



Pi 



The rows (or, equivalcntly, the columns) of the matrix M 
are thus assumed to be a set of N orthonormalized, uni- 
formly random vectors. The quantity Sw 2 v is calculated 



in Appendix [H] 



Swi 



ds 



n 

n l — 



^ fpj {e t \Cl\e 



sp, 



Pi {ej\d\e 



i=i 



sp, 



-W 2 . (80) 



Note that Sw 2 v depends neither on N, nor on the off- 
diagonal elements (£j|f2|ej) of CI. 

For p having the Gibbsian form PTlH9l[^)l . 6w^ v has 
the following features for low and high temperatures T. 
It goes to zero for T — > for the same reasons as Sw^ ax 
does. In contrast, for very high temperatures, where p ~ 
1/n, one has from (|80|l 



5w„, = 



n(n + 1) 



(81) 



Under the same natural condition that we adopted for 
studying the high-temperature behavior of Sw^^, that 
is, (ej|f2|£j) are finite and do not scale with n, we see 
that Sw 2 v oc 1/n for n» 1, which is a typical behavior 
for dispersions of fluctuating macroscopic quantities in 
statistical physics . Note the difference with the high- 
temperature behavior of the maximal dispersion given by 
Eq. TO- 



D. The maximal and the average dispersion of 
work illustrated for a two-level system. 



Let us give concrete expressions of o~w 2 ^ &yi and Sw 2 v for 
a two-level system S. The initial Gibbsian density matrix 
is now a 2 x 2 diagonal matrix with eigenvalues p\ and 
P2 < Pi as given by l|19H ■ The most general matrix form 
of the traceless and hermitian operator Cl in this two- 
dimensional situation is 



v X 



X ~u 



(82) 



Eqs. (|6(JI I8U|) produce then the following expressions for 
tal.„ and Sw 2 v , respectively: 



Sw 2 



where 



Sw 



LU 2 (l-X 2 ) 



2 

max 



LU 2 (l-X 2 ) 1 + 



1 - x 2 
2x 



In - 



w 

u 2 

1 -x 
l+x 



x = pi — pi > 0, 1 > x > 0, 



(83) 



(84) 



(85) 



is a monotonically decreasing function of temperature, 
as follows from Eq. (J25JI. As seen from |JS3| IMjl . both 
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^max an d ^ w tv are monotonically increasing functions of 
temperature T. It is obvious that (Ju^ax > 8w^ v , except 
for the zero temperature situation x = 1, where they are 
both equal to zero. For very high temperatures, that is, 
for x — ► 0, <5u> av = 1/3 in agreement with l|81|l . Note that 
off-diagonal elements of Cl increase (5tu, 2 nax , while <5u> av 
does not depend on them at all. 



V. THERE IS NO DIRECT ANALOG OF THE 
CLASSICAL BK EQUALITY IN THE QUANTUM 
SITUATION. 



The discussion in section TlV Bl provides a definite evi- 
dence to think that in contrast to the classical case, the 
fluctuations of work in the quantum situation are not 
controlled by any direct analog of the classic BK equality 
(|llfl . In the present section we give another illustration 
of this fact. 

Assume for concreteness that the Gibbsian density ma- 
trix p in i|21|) was separated into pure subensembles by 
means of the measurement of H , that is, the subensem- 
bles are described by pure density matrices {|£z)( e d}iLi> 
where {|e;)}p =1 are eigenvectors of p. 

According to l|39|l one has for realizations of the ran- 
dom quantity work 



IV, 



{ei\UlHU T \ Sl ) - e u 

n 

^ CkiSk - I = 1, n, 



(86) 
(87) 



fc=i 

where 

C kl = Kefel^leJI 2 , 
is a double-stochastic matrix: 



Cfci — ^ Cm — 1. 



(89) 



fe=i 



;=i 



Each of realizations wi has probability pi, as given by 
One now constructs 



-/3w 



Pie 



-j3wi 



1=1 



1 ™ 
z ^ 



(90) 



that is, averages e ^ w directly as was done in the classical 
situation. As shown in Appendix iBl 



2Z 



< (e 



-Bw\ 



< 1 



2Z 



A EE £ T (1-CC T )£ 

n 

= J2[^\H\sk) 2 ~(e k \UtHU T \e k ) 
fe=i 



(91) 



(92) 



where e T = (£i,...,£„) is the vector of eigenvalues of 
H, Z is the partition sum defined in (|21|l . and where 
£ m i n and £ max are the minimal and maximal ones among 
(ei, ...,£„). 

Since all the eigenvalues v of the product of a double- 
stochastic matrix to its transpose satisfy < v < 1 16 , 
one has 



e T (l - CC T )e > 0. 



(93) 



Thus {e~P w ) is strictly smaller than unity. As com- 
pared to our discussion of the classical situation in section 
III El the result {e~^ w ) < 1 does not in general permit to 
draw quantum analogs of the classical features b (active 
realizations) and c (dispersion at high T) in section Hi El 



VI. COMPARISON WITH OTHER 
APPROACHES. 

In the present section we study two approaches known 
in literature. The purpose is to understand whether they 
have the proper physical meaning for describing fluc- 
tuations of work. Since they both allow to generalize 
the classical BK equality (though in different ways), the 
adoption of either of them will mean — as we discuss in 
detail below — that there is no major qualitative differ- 
ence in behavior of quantum and classical fluctuations of 
work. It should perhaps be stressed that our concern is 
the applicability of these approaches for describing fluc- 
tuations of work under conditions formulated in the In- 
troduction; their usefulness for other purposes is neither 
discussed, nor criticized. 



A. Observable of work. 

Recall from definitions i|29l I30|l that for any initial en- 
semble described by p, the average of Cl is equal to the 
work done on the corresponding ensemble. 

The approach goes on by stating 0, 0, 153, that 
the operator O is the "observable of work" in the stan- 
dard sense of quantum observables 17 , e.g., the quantity 
tr[pf2 2 ] — [tr(pCt)} 2 is to be interpreted as the dispersion 
of work for any p. However, while tr [Cl p] happens to be 



16 For any double-stochastic matrix C^, consider the matrix CC T , 
where C T is the transpose of C, and let be an eigenvector of 
CC T corresponding to a (necessarily non-negative) eigenvalue 
v - T.k 1=1 dkCikdi = vai. One has \ Y,k=l C ik a k\ = \"<h\ = 
"[Oil < Ek,i=iC ik C lk \ai\, and then i/E"=i l»fcl < Efc=l l°*l> 
that is, v < 1. 

17 Once Q is given an independent meaning as a quantum observ- 
able, there arises a question on its measurability, since the stan- 
dard theories of quantum measurements, see e.g. 46, 47], op- 
erate in Schrodinger representation. We shall not pursue this 
problem here, but rather take as working hypothesis that this 
measurement can be carried out. 
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equal to the average energy lost by the work source W, 
simply due to conservation of the average energy dur- 
ing the system-work-source interaction, this alone is, of 
course, not sufficient to regard f2 as an operator of work. 
In fact, such an interpretation relies on the analogy be- 
tween the definition (|3t)(l of Cl and the classical expres- 
sion © for energy difference. Such analogies are very 
widespread in general, and once it is accepted that f2 rep- 
resents the proper energy difference operator, the exten- 
sion of its interpretation toward operator of work seems 
rather natural. 

Let us however recall from our discussion in the In- 
troduction that we expect for a proper approach to fluc- 
tuations of work to apply in arbitrary non-equilibrium 
situation. It is now possible to argue that in general Cl 
does not have the proper meaning of energy difference 
operator, let alone its meaning as the operator of work. 

Let the ensemble £ (p) have a density matrix p(0) = 
|0) (0| , such that |0) is an eigenstate of Cl = (]\ H U T - H 
with eigenvalue zero: 



Clb 



0. 



(94) 



Recall that U\ H(t) U r is the Hamiltonian in the Heisen- 
berg representation a time r, while the Schrodinger pic- 
ture relation H(t) = H is due to the assumed cyclic 
feature of the process. 
In general, 



(95) 



so that |0) is neither an eigenstate of Uij.HU T , nor an 
eigenstate of H. 

According to quantum mechanics, Esq. (|94fl should be 
interpreted as follows: the operator Cl has on the ensem- 
ble £ (|0)(0|) a definite value equal to zero, that is, if it is 
interpreted as the operator of energy change, then for all 
single systems from £ (|0)(0|) the energy does not change 
during this thermally isolated process. 

There are however concrete examples — see Appendix 
U — showing that l|94|) can be consistent with 



UlHU T 0) ^ (0|i? m |0), for to > 2. (96) 



This shows that the energy does change, since some of 
its moments do. In other words, the interpretation of 
Hi as the energy difference operator is in general unsup- 
portable. Note that the non-commutativity feature as 
expressed by (|95|1 is essential for this conclusion. 



1. Restricted interpretation of Hi. 

A more restricted interpretation of Cl can be given in 
the light of the definition of fluctuations of work discussed 
in section lTTll This will also show that if p commutes with 
Cl (a semiclassical assumption), our approach is consis- 
tent with that of the observable of work. 



Let the eigenresolution of Cl be 

n 

Cl = };Uk\Uk){Uk\- 



(97) 



fc=i 



Note that for Cl to have the meaning of the operator 
of work it is necessary that i) its eigenvalues 
have the meaning of work by themselves, i.e., Uk should 
have both the meaning of average energy lost by the work 
source W and the average energy gained by a quantum 
ensemble, as we discussed in section fill Dl ii) probabili- 
ties of these realizations of work done on the initial en- 
semble £(p) should be given as { (uJk\p\oJk} }t=i- 

Now, if p and f2 commute, 



[p,Ci\ =0, 



(98) 



then their eigenvectors can be chosen the same, and, by 
measuring il, p = J2k=i Pk l w fe) ( w fe I can be separated into 
subensembles {£ (\uJk}(uk\)}k=i with probabilities pk — 
(u)k\p\u>k) ■ The average work done on each subensemble 
£(\ujk)(wk\) is then equal to u>k = (Wjfc|f2|u;fc), and one can 
admit the restricted interpretation of as an operator of 
work. 

Conversely, if p can be separated into subensembles, 



= X! X k\^k)(ipk\, 



(99) 



k=l 



and if each of them is let to interact with the work source 
W such that 



uj k = (ipk\Cl\^k), Afc = (u> k \p\u>k), 



(100) 



then three conditions (|99l 11001 197fl imply commutation 

To show this we proceed in a slightly indirect way, 
which is useful by itself. It can be noted that the disper- 



tv[pCl 2 } - MpCl)} 2 = ^(c^IpK) (cok - Wf (101) 
fc=l 

of the operator Cl provides an upper bound for the max- 
imal dispersion S u^ ax of work given by Eq. (|60|l : 



tr(p^ 2 ) - W 2 - Sw 

= ly (Pt ~Pk) 2 
2 /K P^+Pk 



2 

max 



|(p fc |f%}| 2 > 0. (102) 



The equality in the RHS of (|102f> is realized only if p and 
O commute, that is, either (pk\£l\pi) is zero for i =/= k, 
or for some pair i ^ k one has (pk\Cl\pi) ^ 0, but the 
corresponding eigenvalues of p are degenerate: Pi = pk- 
Thus 5w; max can be equal to tr(pCl 2 ) — W 2 only if [p, Cl] = 
0. 
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Now note that if Eq. J^O iltTUl 157) 1 are assumed to be 
valid, they imply tr(pf2 2 ) — VF 2 — Sw^^ < simply due 
to the definition of the maximal dispersion. This is con- 
sistent with Eq. [tTU2l only for tr{p£l 2 )-W 2 -Sw^ = 0, 
which implies [p, fi] = as we saw above. We conclude 
that (E3 Cnni EH imply ||5j§, as was promised. 

Thus, when [p, fi] ^ 0, f2 does not qualify as the opera- 
tor of work even in the restricted sense. We also conclude 
that though the approach does predict an upper bound 
for Sw 2 , this bound is not reachable 18 . 



2. On a generalization of the classical BK equality. 

Though fl does not have the meaning of the opera- 
tor of work — except in the restricted sense and under 
condition (|98|) — there is an operator generalization of 
Eqs. Ijlll |12H which was proposed by Bochkov and Ku- 
zovlev in [lj, Ell : 



tle -pd-f3H 



tr ( cxp 



exp 



dse- sH {le sH 



dse- sH £le sH 




We recall its derivation in Appendix El 19 . A similar 
relation was derived in [27]. 

Let us work out some consequences of (|1U4|I . As com- 
pared to the classic case, the matters are complicated by 
the presence of anti-time-ordering and the integral in 
(|103lll(jj|) . If one would insist on not having them, then 
the equality (|103l I104fl can still be converted into an in- 
equality. By applying Thompson- Golden inequality [5{| 

20 , tr[e A e B ] > tre A+B , valid for any hermitian opera- 
tors A and B (the equality sign is realized here if and 
only if [A, B] =0), one gets 



tr [pe 



k=l 



1 

z 



(105) 



18 Note that the difference tr(pCl 2 ) - W 2 - SW 2 = 
Ea=l A « ( W>c«|fiV«} - <V a |n|Va> 2 ) > is by itself always 
non- negative for any separation of p into subensembles. 

19 For the equilibrium ensemble 1211 . the Thomson formulation of 
the second law can be derived from IlLKjllTUH upon the applica- 
tion of the Peierls-Bogoliubov inequality (recalled in Appendix 
[EJ: e ~' 3tr [' i "l < ^tre-i 3 "-! 36 = 1. From this it follows once 
again that W = tr [fi p] > 0. 

20 Thompson-Golden inequality is a particular consequence 



of the following submajorization relation A (e^+^j 
A [e^-l 2 e^-l 2 ^ , where A (^Aj is the eigenvalue vector of a her- 
mitian operator A; see l5Sl for more details. 



where \ujk) and cuk are eigenvectors and eigenvalues of f2 
as defined by l(57)). 

If now we could interpret f2 as the operator of work, 
that is, if the eigenvalues Uk of f2 would have the meaning 
of work by themselves, we would note that {u>k\p\^k) is 
the probability of observing the eigenvalue Wk upon the 
measurement of Cl on the state p, and then Eq. (|10B|) 
would allow us to study fluctuations of work exactly 
in the way we did in section III El for the classical sit- 
uation. We would then draw the same general conclu- 
sions, and the fact that H105(l is an inequality will only 
strengthen these conclusions as compared to the classical 
situation. However, as we saw above, it is impossible to 
identify Q with the operator work, and thus fluctuations 
of work cannot be studied on the base of (|105|L except 
for [p, Q]=0, where Eqs. (EH CHI EHSJ) reduce to the 
usual (essentially classical) BK equality. 



B. On the approach based on two-time 
measurements of energy. 

Yet another, different approach to fluctuations of work 
and extension of the classical BK equality was proposed 
in Refs. |28l l29l l30| . We shall present it in a more ex- 
tended form, since it is necessary for the understanding 
of its proper physical meaning. On the other hand, in 
order do not dwell into unnecessary technical details, we 
shall assume that the spectrum of the Hamiltonian H is 
non-degenerate (compare with (23) 



£i<e 2 < ••• < £„. 



(106) 



At the time t = one measures energy (corresponding 
to the operator H) for the ensemble described by the 
gibbsian density matrix (|2"T|l . The probability to get an 
eigenvalue Ei of H is seen from (|20|) to be 



P (l\Mo) = (ezlph) 
= Pi- 



(107) 
(108) 



Eq. I|107|) is the general quantum formula (Born's rule), 
while Eq. I|108|) follows from the Gibbsian form (J2U Ell 
1221 of p. The symbol A4n in (|107|l reminds that the prob- 
ability is conditional and refers to the measurement of H 
done at t = 0. The necessity of such explicit notations 
will be seen below. Formally it is always allowed, since 
any probability is conditional. 

According to Wigner's formula for multi-time proba- 
bilities in quantum mechanics |4^, the subsequent mea- 
surement of energy at the time r — represented by the 
same Hamiltonian H due to the cyclic feature of the con- 
sidered process — will then produce a result with the 
conditional probability 



p(k\l,Mr,M ) = \{e k \U T \e l ){< 



(109) 



There three conditionals for the probability in the LHS 
of (|109|l : A^o and M. r stand for the measurements done 
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at t = and t — r > 0, while the index I indicate on the 
result £; got during the first measurement. The meaning 
of II109JI is that the ensemble of systems which during 
the first measurement at t = produced the result Ei, 
is described for t > by |e;)(e;|. The members of this 
ensemble couple to the work source W, the state evolves 
to U T \ei)(ei \ Ul at the time t = r, and then is subjected 
to the second measurement. 

Thus the total probability for having the result e; at 
the moment t = and the result e k at t = r is given by 

p(k,l\M 0> M T ) = P (l/ Mo, M T ) P (k\l, M T , M o )(n0) 
= p(l\Mo)p(k\l,M T ,Mo). (HI) 

When passing from lllUfl to 11111) . we used the obvious 
relation p(l\Mo,M T ) — p{l\Mo) (no dependence on the 
future). 

It is to be noted that 

n 

p(k\Mo,M T ) = J2p(k,l\M ,M r ) 

1 = 1 

= J2pi(£k\Ur\ei)(s l \U T \e k ), (112) 
i 

that is, the probability to have the result s k at the second 
measurement is for a general initial density matrix p not 
equal to 

p(k\M T ) = (e k \U T pUl\e k ), (113) 

which is the probability to get the result A; in a different 
context, where no first measurement was done. Such an 
equality is valid, though, if p commutes with H, which is 
the case with the Gibbsian density matrix (|21|l . Let us 
first restrict our attention to this case. One notes from 
(|l(J9fl the double-stochastic feature of p(k\l, M T , Mo): 

n n 

Y,P(k\l,M T ,Mo) = ^p(fc|/,K,M ) = 1, (H4) 
fc=i i=i 

and calculates using 1(131 1221 ITUS! ITT3|l : 

(e -/3( efe - EJ )) 0)T 

n 

= p(l\Mo)p(k\l,M T ,Mo)e-^«-^ 
k,l=l 

1 " 

= zY, P(k\l,M T ,M )e-^ = 1. (115) 
fe,i=i 

This is the equality got in Refs. [2^, |2^, |3(J as a gen- 
eralization of the classic BK equality. 

Note that for the density matrix (|21fl the average 

n 

P(l\M )p(k\l ) M T ,M )(e k -£ l ) = W, (116) 

k ,i=i 

is equal to the work as defined by l|29|l . The statement of 
the second law, W > 0, can once again be deduced from 
(|115f) by employing convexity of the exponent. 



1. Critique of the approach. 

Would now we be able to associate the work with a 
random variable having realizations {e k — £i}£ ;=1 and 
the corresponding probabilities {p{k\l,M T ,Mo)} kl= i, ^ 
would be possible to study fluctuations of work on the 
base of Eq. i|115[) , and to draw essentially the same con- 
clusions as we did in section Hi El for the classical case. It 
is, however, not difficult to see that the same criticisms 
we brought in section fVl Al with respect to the "observ- 
able of work" applies here too. 

Keeping in mind our discussion after Eq. (|l(J9fl . note 
that if the ensemble initially described by |£z)(£;| couples 
to the work source W, its mechanical degree of freedom 
looses at the time t = r the energy 

tr(fi|e,)<e,|) = tr [hU t |e,)(ej| tfj - e x . (117) 

Since the final density matrix U T |ej) (e; | UJ. need not com- 
mute with H, the energy need not any definite value at 
that time, and Eq. (|117f) does in general not reduce to 
£fe — ei with any k. Such a reduction takes place when 

n 

tr (hU t \ei){ei\uf^ = £ C kl e k = e <t) , (118) 
k =i 

for l = l,...,n, (119) 

where C k i is defined via (|88|l . and where (7r(l), ...,7r(n) ) 
is some permutation of the sequence (1, n). Eq. I|118|) 
can now be re-written as 

n 

Y / C k i(e k -e l ) = 0, (120) 
fe=i 

where the matrix C = C II the product of C and the cor- 
responding permutation matrix II, and where we noted 
that once the matrices C and II are double-stochastic (see 
iJHSJ for definition), so is C. Note with help of Eq. ifTDBT) 
that for I = n all terms with k ^ n in l|12(J|) are neg- 
ative unless C k ^ n = 0, which via the double-stochastic 
feature of C implies: C n ^ k — and C nn — 1. Contin- 
uing along the same lines for I < n, one gets that 1|118|) 
can take place only when C reduces to unity matrix, or, 
equivalently, C reduces to a permutation matrix: 

U r \ei){ei\tl = (e w(l) \. (121) 

Thus, in general it is l|117|) and not e k — Si that can be 
interpreted as the work for this single realization occuring 
with probability pi , and this is precisely the point from 
which we departed in section ITTT1 

It is also straightforward to see that the approach does 
not apply out of equilibrium. The reasons for this are 
more straightforward than for the previous approach. 

Recall from the Introduction that the proper definition 
of fluctuations of work is expected to apply to any non- 
equilibrium initial ensemble £{&) with density matrix a 
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not commuting with H: 



C. Summary. 



[a,H]^0. 



(122) 



In particular, the work averaged over those fluctuations 
should be equal to the one done on the ensemble. 

The present approach is generalized uniquely for ar- 
bitrary initial state: the definitions of p(1\A4q) and 
p(k\l,M T ,-Mo) inEqs. H109lll()7|) remains unaltered: one 
substitutes there a instead of p. 

It is now straightforward to see from l|122|) 
that due to non-diagonal terms in <r, the average 
YJk.i=iP{ l \Mo)p{k\l, Mr, M ) (e k ~ Ei) is not equal to 
the work tr(<rf2) done on the overall ensemble: 

n 

tr(aO)- Y, p(l\M )p(k\l,M T ,Mo)(e k -s l ) 



k.l=l 



= tr 



[U}HU ( a - \ei) (e t \&\e t ) (e, | )] ^ 0. (123) 



2. The approaches based on the "observable of work" and 
on two-time measurements of energy are different. 

This is seen already by comparing Eq. (|1(J5|) with (|115fl . 
Still we want to understand this difference in more de- 
tail. More precisely, though for the initial density matrix 
commuting with H, the first and the second moments 
generated by the two approaches are equal: 

n 

tr[p£l p ] = Y p(l\M a )p(k\l,M T ,M )(e k - 6^,(124) 

k,l=l 

p = 0,l,2, (125) 

already the third moments are in general different, even 
for [p, H) = 0. Indeed, assuming validity of the latter 
condition, one gets 

n 

tr[pn 3 }- ]T p(l\Mo)p(k\l,M T ,M ) (e k - st) 3 

((l [Cl, p] H\ = tr (p [H, fi] Clj . (126) 



fe,i=i 



tr 



The RHS of (fT^f) vanishes if [p, Cl] = 0, or [H, Cl] = 0, 
in addition to [p, H] = 0. 

Note as an illustration that for the two-level example 
of section HVDl the RHS of JH| reads: 



tr 



(p [H, Cl] n) - (p! - P2 )(e 1 - e 2 )\x\ 2 , (127) 



where ft is given by (|82(l , and where p k and e k are eigen- 
values of p and H , respectively. For the Gibbsian density 
matrix p, the RHS of (|127|l has negative sign. 

Finally note that differences between the two ap- 
proaches were recently studied in |32| in a different con- 
text. 



We have discussed two approaches known in literature, 
and argued that in the proper quantum domain, though 
being related to energy, they do not describe fluctua- 
tions of work. Work is a rather particular form of energy 
having several specific features we discussed in section 
IIIBI These two approaches allow different generaliza- 
tions of the classical BK equality and this makes them 
operationally close to the classical situation. Still this 
resemblance is superficial, since, as we argued, it is not 
ensured by the two approaches that realizations of the 
claimed random quantity/operator of work have them- 
selves the physical meaning of work. 



VII. CONCLUSION. 

The second law has a statistical character as it is both 
formulated and valid for ensembles of identically pre- 
pared systems. It is therefore of interest to investigate 
this statistical aspect in more detail. For the entropic 
formulation of the second law, this analysis is by now a 
standard chapter of statistical thermodynamics 0, 0, y] • 

In the present paper we studied how Thomson's for- 
mulation of the second law: no work from an equilibrium 
ensemble by a cyclic process, emerges through averaging 
over fluctuations of work in the quantum situation. It 
will be useful at this moment to recall the special role of 
Thomson's formulation, and then to proceed with con- 
cluding remarks on our results. 



1. The main features of Thomson's formulation of the 
second law. 

a. The formulation uses the concept of work which 
is unambiguously defined both conceptually and opera- 
tionally, both in and out of equilibrium. In this respect 
work is contrasting to entropy, which is well-defined only 
in (nearly) equilibrium states of macroscopic systems. 

b. Thomson's formulation is valid for any finite 21 
or infinite |17| . quantum or classical system interacting 
with macroscopic sources of work. In this context one 
notes that not all formulations of the second law have 
such a universal regime of validity. While all formulations 



21 In this context one sometimes hears that the second law must 
refer to macroscopic systems, and there is no sense in applying 
it for finite systems. This opinion is not correct, as instanced by 
Thomson's formulation. Would it not be valid for a finite system 
coupled to work sources, the very its application to macroscopic 
systems will be endangered, because the initial Gibbsian ensem- 
ble 1211 is prepared under weak interaction with an equilibrium 
thermal bath, see e.g. 0: any cycle violating the formulation 
for a finite system can be repeated to achieve a violation for the 
bath. 
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are equivalent in the standard thermodynamical domain, 
that is, for (nearly) equilibrium states of macroscopic sys- 
tems, some of them have definite limits when considered 
for finite systems |20| or for low temperatures (quantum 
domain) Hi 03. 

c. In its literal form Thomson's formulation does 
not imply any irreversibility, since the dynamics of the 
system coupled to work source is unitary and thus for- 
mally reversible: if some work was put into the initially 
equilibrium system it can in principle be extracted back. 
The irreversibility with respect to work transfer comes 
into existence when one takes into account that in prac- 
tice no work source can interact with all possible degrees 
of freedom. In particular, if the system was subjected to a 
thermal bath after it had interacted with the work source, 
it relaxes back to its Gibbsian state and the work which 
had been put into it cannot be recovered by any work- 
source acting on the system only (a similar argument is 
presented in |(|, chapter 5). Thus the relation between 
Thomson's formulation and irreversibility is seen clearly 
at least in the simplest situation. 

d. It should perhaps be stressed that Thomson's 
formulation does not refer to all aspects usually associ- 
ated with the second law, e.g., by itself it does not explain 
how a subsystem of a proper macroscopic system (ther- 
mal bath) relaxes toward a Gibbsian equilibrium state 
(though on the basis of Thomson's formulation it is still 
possible to argue that - under certain assumptions - the 
Gibbsian state is the only one which forbids work extrac- 
tion via any cyclic thermally isolated process [iH Il8j). 
The property of relaxation toward a Gibbsian state is to 
be viewed as an independent item of statistical physics; 
its standard classical understanding was re shap ed in lit- 
erature various times; see, e.g., [ill Il2l ISfil l37l| . 



2. What appeared to be problematic in defining fluctuations 
of work in the quantum situation. 

As we saw in section I VII due to non-commutativity 
of various quantum observables, there are different quan- 
tum quantities which, in the classical limit, coincide with 
the random quantity work. Thus, as often, classical rea- 
soning alone is of no help for defining fluctuations of 
work. 

One therefore has first to state what basic features the 
fluctuations of work are going to have, as we did in section 
[I] and then to recognize that the work is an essentially 
mechanical, classical quantity — in spite of the fact that 
it can be added or extracted from a quantum system — 
since it is an energy transferred to macroscopic degrees 
of freedom of the work-source that, at least in principle, 
should be retransferrable to other classical sources. 

Once this feature was recognized, the definition of fluc- 
tuations of work we presented in section IIIII is straight- 
forward. 



3. What is similar and what is different in classical and 
quantum definitions of fluctuations of work? 

In both situations the definition of work as a random 
quantity employs the same idea: the initial homogeneous 
ensemble of identically prepared systems is separated 
into irreducible (homogeneous) subensembles. Both in 
quantum and classics these irreducible subensembles are 
described maximally completely. In classics they corre- 
spond to a trivial subensemble of identical copies of the 
same system (so that within a subensemble no fluctua- 
tions are present whatsoever), and they are described via 
phase-space points and trajectories. In quantum mechan- 
ics these subensembles, described by pure density matri- 
ces (wave functions), provide definite (non-fluctuating) 
values for the largest possible, but non-exhaustive, set of 
observables. 

In classics the irreducible subensembles of the initial 
ensemble obviously exist a priori, that is, without need 
of any measurement. In the quantum situation the very 
structure of subensembles does depend on the measure- 
ment applied for the actual separation, or, in other words 
for the preparation of an ihomogeneous ensemble. As 
the main consequence, the separation of a mixed ensem- 
ble is not unique, and thus the random quantity work is 
contextual in the quantum situation. It is therefore to 
be recalled that in the quantum situation the definition 
of fluctuations of work always needs this initial prepara- 
tional measurement, a step which is absent in classics. 

In the second step, systems from each irreducible 
subensemble interact with the same macroscopic source 
which docs on them the same thermally isolated process. 
Realizations of the random quantity work are then de- 
fined as the average energy increase of the work-source 
when interacting with each subensemble, while the prob- 
ability of each realization is given by the weight of the 
corresponding subensemble in the initial mixed ensemble. 

In this way the full physical meaning of work is kept, 
and the approach can be applied to any non-equilibrium 
initial state. 



4- Dispersion of work. 

The most direct quantity which characterizes fluctua- 
tions of work is the dispersion of work we studied in sec- 
tion llVI Although the work is a contextual random quan- 
tity and depends on the measurement that was done to 
separate the initial mixed ensemble into pure subensem- 
bles, one can define two reasonable quantities — maximal 
dispersion and the dispersion for a randomly chosen sep- 
aration on the initial ensemble — that depend solely on 
the internal features of the system, that is, on its initial 
state and its time-dependent Hamiltonian. These quan- 
tities were calculated explicitly for any finite quantum 
system and studied in section llVl 
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5. Non-existence of the direct generalization of classic BK 
equality. 

In the classical situation fluctuations of work in an ini- 
tially equilibrium state are controlled by the BK equality 
[la.l2l| . This equality allows to draw a number of model- 
independent statements on fluctuations of work which we 
summarized in section III El In contrast, the direct gen- 
eralization of BK equality to the quantum domain — the 
one which would allow to draw similar qualitative conclu- 
sion on fluctuations of work — does not exist; see section 
IVl As we discussed in detail in section I VII there are 
quantum generalizations of BK equality, but they refer 
to quantities which describe fluctuations of work only if 
some classical features are present, e.g. those implied by 
Eq. i|98|) . As the main consequence, fluctuations of work 



in the quantum situation can have features which are im- 
possible in classics, e.g., (inter-subensemble) fluctuations 
can be absent completely. 
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APPENDIX A: DERIVATION OF EQ. ||T5J>- 

Here we recall from [6(j a generalization of Cauchy 
inequality used in Eq. I|15|) . 

Denote by T — (x,p) the phase space point. Assume 
that al the integrals over the phase-space used below are 
finite. The desired inequality reads: if a(T), b(T), x(T) 
are some functions satisfying 

J dr a(T) x(T) = 0, J dr b{T) x(T) = 1, (Al) 



then 



J dTx 2 (T) 



> 



JdTa 2 (T) 



JdTa^T)JdTbHT)^[jdTa(T)b(T)] 



r-(A2) 



To prove this, define 

a = J dr a 2 (r), 
b = J dr& 2 (r), 
c = J dra(r)6(r), 



Ab(T)-Ca(T) 
AB-C 2 : 



(A3) 
(A4) 
(A5) 
(A6) 



and note that 



J drx 2 (r)> J diY(r), (A?) 



due to 



j drx(r) y (r) = J dv y 2 {T), 



(A8) 



which is valid by constructions l|A3IIA6(l . Eq. (|A2f> fol- 
lows from (| A7|) . To get from here Eq. (|15fl one identifies: 
x(T) = y/vF), b = y/V<rj e-^ r \ a = ^PjT){f(T) - 
(/))• 



APPENDIX B: DERIVATION OF EQ. (|9H) . 

Let f(x) is a smooth function, {xi}f =1 are n points, 
and 



^X k x k , X k >0, 2jAfe = 1. 



(Bl) 



k=l 



k=l 



Apply the incomplete Taylor expansion to f(xi) 



f( Xi ) = f(x) + f{x) (Xi -x)+ f -^- (Xi - xf ,(B2) 

where ^ lies between Xj and x. Denote by a: max and x m i n 
the maximal and the minimal numbers among Xj. This 
implies x max > ^ > x m i n . Now assume that f"(x) is 
monotonically decaying: 



f(w) > /"te) > r(x min ). 

Using P2IB3H one has 



(B3) 



n n 

E Afc /(^) " = oE f'(^)Xk(x k - x) 2 , (B4) 



fe=i 



fc=i 



/ (^min) 



E ^k(x k - x) 2 > E Afe/(xfc) - /(x) 



fe=i 



k=l 



> 



f (^max) 



(B5) 



k=l 



To derive Eq. J^U, start from JSUJl, apply to it Eq. 
with a convex function /(x) = e~^ x , (3 = 1/T > 0, 
and identify x, = e», = C k i. The desired Eq. (|91|l is 
recovered upon the summation over I. 



23 



APPENDIX C 

Let tr(A 2 p) = [tr(Ap)] 2 for some hermitian operator 
A and density matrix p. In the main text we called such 
operators dispersionless with respect to the ensemble de- 
scribed by the density matrix p. 

In Cauchy inequality |tr(i£J)| 2 < tr(ii t ) tr(BB^), 
which is valid for any operators A and B, the equality is 
realized for A = aW , where a is a number. Thus 



= tr(i 2 p)tr(p) 



implies 



A\/P = a 



(CI) 



(C2) 



Now apply the eigenresolution y/p = J2k=i \/Pfc l E fc)( £ fc| 
into (|C2(I and multiply it from the right by \p m ) to obtain: 



APPENDIX D: RELATION BETWEEN POVMS 
AND PROJECTIVE MEASUREMENTS. 

We outline how a POVM given by gSl SI E3> can be 

connected with the usual projective measurements. 

Let the system S interact with another auxiliary sys- 
tem Q. The initial states of S and Q are, respectively, p 
(living in a n-dimcnsional Hilbert space Ti) and 
living in a ./V-dimensional Hilbert space Ti.' . The initial 
state of the overall system S + Q is p (g> \gi){gi \- 

Select a fixed orthonormal base {|ufc)}jj =1 in Ti. Let as 
well \gi) be a member of an orthonormal base {|<?q)}^Li 
in Ti'. Assume that the above interaction is chosen such 
that the corresponding unitary evolution operator hi in 
the composite Hilbert space Ti®Ti' results in 



U \u k ) <g> \gi) 



N 



G a \u k ) ® \g a ), 



(Dl) 



/p^A\p m ) = ay/p^\p m ). 



(C3) 



It is seen that either only one among the eigenvalues pk 's 
is non-zero and then the corresponding eigenvector is also 
an eigenvector for A, or, more generally, that A acts as 
oc 1 in the Hilbert space formed by eigenvectors of p 
corresponding to its non-zero eigenvalues. In both cases 
the measurement of A on the state p always produces 
definite results. 

Thus any operator A that is dispersionless on the den- 
sity matrix p has to have the following block-diagonal 
matrix representation 



A = 



a lfexfe 





B 



(C4) 



where a is a real number, lfc X fc is fc x fc unity matrix in 
the fc-dimensional Hilbert space formed by eigenvectors 
corresponding to non-zero eigenvalues of p, and finally B 
is an arbitrary (n — k) x (n — k) hermitian matrix. It has 
(n — k) 2 free parameters, and another free parameter of 
A is coming with the real number a. Thus, A has 

(n-fc) 2 + l, 

free parameters. 

Note finally that various operators that are dispersion- 
less on a pure density matrix need not be mutually com- 
muting. As one of the simplest examples take 



IV>> = 




where e is real. As seen, C\i/j) 
but [C,D] £ 0. 



\ip) and D\ip) = e\ip), 



where G a defined by l|43|l are operators living in the 
Hilbert space Ti. 

One notes that due to l|43|l 



(gi \ <g> (ui\U\u k ) ® \gi) = Ski- 



(D2) 



This specification of U is not yet complete. To complete 
the definition of U in the composite Hilbert space Ti^Ti 1 
one should define its action on |itfc)(g> \g a ) for a = 2, N 
in addition to I|D1|) . This will suffice, since \v,k) ® \g a ) for 
k = 1, n and a — 1 , . . . , N is an orthonormal base in the 
composite Hilbert Ti®Ti' . This completion is possible to 
do and one can do that in many different ways, because 
it amounts to completing the orthonormal set of vectors 
G a \uk) ® \g a ) to an orthonormal base in H (g) Ti' . Then 
the columns (or equivalently the rows) of Li in the base 
Wk) <8> \g a ) w iH be a set of of Nn orthonormal vectors, 
which is equivalent for hi being a unitary matrix. For 
a given unitary matrix hi there is a hermitian operator 



H ov in Ti ® Ti' such that hi = exp 



with some 



time-parameter t. Thus, H ov can serve as a Hamiltonian 
realizing the needed interaction. 

Once (|Dlll is valid for the fixed base {|wfc)}Jj =1 , one 
gets for an arbitrary p in Ti: 



N 



Uptf = Y J G aP Gi®\g a ){g 



(D3) 



a = l 



To complete the construction, one now measures in Ti' 
any hermitian operator with a non-degenerate spectrum 
having the base {\g a )}a=i as its eigenbase. Then the 
POVM (021 SI E3 accounts for what is happening - 
after the interaction and after the selective measurement 
— with the initial ensemble described by p. 
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APPENDIX E: DERIVATION OF EQS. Jl03l H04l) ■ 

One notes from flfllEgfr 



UtpUr 



exp 



-fiUl H U T 



-PSI-PH 



(El) 



where we used the definition Ct — Ujj. H U T — H of Cl. 
Note the standard equality 



P 



ds e 



~ sH he sH 



^-PH 



, (E2) 



where exp means time-antiordered exponent. The easiest 
way to derive Eq. 1|E2(1 is to differentiate both sides of 
it over /3, and note that they both satisfy to the same 
first-order differential equation with the same boundary 
condition at /3 = 0. 
One now gets 



Ulf>Ur 



exp 



dse- sH ne sH 



(E3) 



Tracing out both sides, one finally obtains Q103I 1104*)) . 

The simplest way to derive Peierls-Bogoliubov inequal- 
ity from (|103l I104fl is to note the well-known extremal 
feature of the free energy: 

-Tlntve-^-^ 
= rain 

where the minimization is taken over all possible density 
matrices. This can alternatively be written as 



tr e 
= max 



PH-pn 



expj-^tr^^ + fi)] -tr(prnp)} . (E4) 



The desired inequality is got by putting the particular 
density matrix p — e~@ H 



/Z in the RHS of l|E4l 



APPENDIX F: PROOF OF HORN'S THEOREM. 

We intend to prove that given two vectors x T = (x\ > 
... > x n ) and y T = (yi > ... > y n ) with the following 
majorization relation 

x<y (Fl) 
there is a real orthogonal matrix O — (Oy ) such that 

x i=J2 ° x = dia § [ dia s[^] ° T ] • ( F2 ) 

3 

Here diag[y] means the n x n diagonal matrix formed by 
the vector y, diag [ O diag[y] O t ] is the vector formed by 



diagonal elements of the matrix Odiag[y]0 T , and O t 
means transposition: (0 T )ki = Oik- Note that for any 
orthogonal matrix Oij, the matrix O?- is always double- 



O 



2 



1, while the converse is 



stochastic: 
not true |56^ 

For a given orthogonal matrix there are many uni- 
tary matrices Qij such that Ofj = \Qij\ 2 ; e.g., Qij = 
e^Oij, where <pj are arbitrary phases. 

The proof is adopted from Ref. and will be realized 
in two steps. 



1. First step. 

First, one shows that I|F1|I implies 

x =T 1 T 2 ...T„_ij/, (F3) 

where matrices T are so-called T-transform defined as 
follows. Any T-transform T(m, Z; t) has three parame- 
ters: m < I and t, where m and I are natural numbers 
between zero and n, and where < t < 1. Its action on 
any vector y, y T = (y\, .... y n ), is defined as 



z = T(m,l;t)y, 



(F4) 



where 



z T = (yi, :,y m -i,ty m + (1 - t)y h y m+u 

(1 - t)y m + tyuyi+i, ...,y n ) (F5) 

To get the matrix of T(m, 1; t) starting from nxn unity 
matrix 1, one proceeds as follows: 

(l)mm = 1 — ► ( T(m, I; t) ) mm = t, 

(i)u = 1 -> {T{m,l-t)) u =t, 

(i) m/ = -» {T(m,l;t)) m i = l-t, 

(l)lm = -f ( T(m, Z; t) ) Iro = 1 - t, (F6) 

while all other elements of the unity matrix are left un- 
changed. 

Eq. (|F3I) can now be proven by induction. It is obvious 
for n — 2. Assume it holds for n — 1. As seen from (jFll 
1731 174|) . one has y„ < xi < yi, so there exists an index k 
that 

?/fc < xi < y x . (F7) 

This implies 

xi = ty x + (1 - t)y k , (F8) 
for some < t < 1. Define a T-transform T(l,k;t) via 



T(l,k;t)y, 



(F9) 
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where 

V T = (2/2, -,yk-x, (1 ~ t)y 1 +ty k ,y k+1 , ..,y n ). (F10) 

It is straighforward to show that 

f> (x 2 ,..,x n ). (Fll) 

Since we assumed I|F1|1 =>■ (IF3|) is valid for n — 1, there 
is a product of T-transforms such that (£2, .., x n ) T — 
T 2 ...T n -i y. 

Thus the implication 1F1|I (IF3|) is proven by induc- 
tion. 

2. Second step. 

Let us finally prove the implication (|F1|) =4> l|F2J) . Note 
that for any T-transform T(m,l;t) one can associate an 
orthogonal matrix V(m, I; t) by reshaping (|F6|) as follows: 



-> (F(m,Z;i) ) mm = Vi, 
(V(m> *?*))« = y/i, 



(l)mm — 

- 1 

(i)mj = -»■ ( V(m, I; t) ) ml = -vT^t, 
(i)/m = -» (V{m,l;t)) lm = VT~t. (F12) 

Then l|F4|l is equivalent to 

z = diag [ V(m, J; t) diag[y] V T {m, l;t)] . (F13) 



To prove the implication (|F1|) =4> (|F2|) one again pro- 
ceeds by induction. It is obviously valid for n = 2. One 
assumes its validity for n — 1 . This means (|F11|) can be 
re- written as: 



(x 2 , ■■■,x n ) T = diag V diag[y] V 



(F14) 



where V is some orthogonal matrix. To complete the 
proof, define an orthogonal matrix 



O = 



1 
V 



V, 



where the matrix V corresponds to the T-transform T 
defined in (|F9|1 ( via the correspondence described in 
(E23)) and rewrite (|FTTllFll as 



x = diag [ O diag[y] T ] . 
This proves the implication IjFlf) =>■ (|F2p . 



(F16) 



For the index k and for the parameter t mentioned before 
(|F7|) one has 



Analogously 

n 





k = 3, 



1 



2 2 



3 

t = -. 

5 



\ /I l\ / 



1 



(F18) 



U 1 i 

\ " 9 



V 



I $ 

5 u 5 



/ 



(F19) 



V-3/ 



realizes the relation between x and y via T-transforms. 
Finally the orthogonal matrix O in l|F16|) reads for the 
present example: 



/ 



O = 







V 5 



(F20) 



APPENDIX G: DERIVATION OF EQ. dSOl) . 



Here we find the maximum of 

N N 



= J2 x <* w « = E 



a=l 



t^l (V'alV'a) 



(Gl) 



where the maximization is taken over all possible decom- 
positions H34JI of the mixed state p into pure states. Using 
(1541 156( 1 one writes equivalently 



{W 



N 

E 

a=l 



tr pll a 



(F15) 

where 



n Q = |7r Q )(7r Q | 



(G2) 



(G3) 



The maximization in (|G2|I is taken over all decompo- 
sitions of unity 



N 



(G4) 



a=l 



3. An example. 

Let us realize explicitly the above construction for an 
example: 



x= (0,0,0), y =(2, 1,-3). 



(F17) 



where operators II Q live in the n-dimensional Hilbert 
space TL. 

The general idea of the following method was adopted 
from |62j |. Introduce an operator X via 

pV 2 ilpV 2 = Vt(pX) = \(pX + Xp) , (G5) 



20 



then 

tr(p 1 / 2 0p 1 / 2 n ct ) = 3ttr(il a pX). (G6) 

Recall Cauchy inequality 

|tr(is)| 2 < tr (11+ )tr(BB t ), (G7) 

which is valid for any operators A and £?, with the equal- 
ity being realized for 



A = aB\ 

where a is a number. 

Applying inminzj: 

r i 2 

tr(/5 1/2 ^/5 1/2 n Q ) < |tr(n Q pX)| 2 

^Itr^p^p^xnV 2 )! 2 

<tr(fl a p) tr(pXn a l), 
one gets for JHSIESJl: 

2 



(G8) 

(G9) 
(G10) 



JV 

E 



[tr (pVSQpi/s n Q ) 



AT 



tr pIT c 



< ^tr(pxn Q i:) (Gii) 



a=l 

'2\ - f^ol/26sl/2; 



= tv(pX 2 ) = tatfl'top/'X) (G12) 

Eq. (EH is realized as equality for 

3tr(n a pi") = 0, (G13) 

while the Cauchy inequality (|G11J| becomes equality for 

/ 3 1 / 2 l|7r Q )=a a/ 3 1 / 2 | 7 r Q ) ) (G14) 

where a a are some numbers. 

Both conditions (fCTTal iGMjl are realized simultane- 
ously by taking {|7r Q )}™ =1 and {a Q }™ =1 as, respectively, 
eigenvectors and eigenvalues of the hermitian operator 
X. The representation l|62(l for X follows from l|G5Jl . 
The desired equation (JHOJ) is seen from IjGllI IG12|) . 

APPENDIX H: DERIVATION OF EQ. dSSll . 

Here we calculate the average {(w 2 )} av of (w 2 ), given 
by (|G1|I . over the measure (jHOJl. Using Eq. (gHl it is 
straightforward to see that all the terms in the summa- 
tion in the RHS of I|G1|) produce the same average. Thus, 



J VMS 


Elx\Mi\ 2 -l_ 




J VMS 


Eli \Mt\* 


- 1 





N 

where we denoted 

VM = Y[ dKMi dSMi 



(HI) 



JV 



(H2) 



and where one notes from l|49() 
<j>{Mi} 



M j M^^/pJp^{e k \9.\e j 



(H3) 



Passing to polar coordinates 

,2tt N 



vm= Udipi / ni M *i d i M *i ( H4 ) 

■/ -/O j_i Jo „-_i 



one gets 

{(^ 2 >}a 



jV 



£ ^p fc ( £i |0| ei )( £fe |0| e ,)^, (H5) 

2 o 



where 



JV 



Ijk 



= / [Tdz, <s 

i=l 

/ ] I' h < 6 

Jo ._, 



iV 



i=l 
JV 



En 



i=l 



, (H6) 



(H7) 



These integrals are calculated for j,k — 1, .., n by the 
same method. For example, 



— r JV r 



r(jv + 



JV 



' JV 



^ 2e r 

En : 
i=iPiVi 



N 



1=1 

oo n 



En ) 
i=iPiVi 



e -"ET=i vi 



(H8) 
(H9) 



/ ds / TTdyiy? e -£"=i^ spi+1 
Jo t\ 



oo /■oo N 



where when passing from 1|H8(I from (|H9|) we changed the 
integration variable, zi — jji/r and integrated over r from 
to oo. 

Further calculations are straightforward and lead to 
(I80|l . For dealing with this equation the following formula 
is useful: 



• /u fe=i fc=i 

where #fc's are some positive numbers. 



(H10) 



APPENDIX I 

Here we give an example of the situation discussed 
around (|94l I95|) . The effect announced there does ex- 
ist neither for two-level systems — simply because for a 
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2x2 traceless matrix Cl a zero eigenvalue implies Cl = 0, — 
nor for three level systems. This last fact requires some 
calculations which will be omitted. 

The simplest situation which supports the effect is thus 
a four-level system. The following example was inspired 
by ■ Consider a four-level system with Hamiltonian: 



H = 



A 
B 



(II) 



has a doubly degenerate eigenvalue equal to zero, and the 
corresponding eigenvectors can be taken as 



|0i> 



|0 2 ) 



(16) 



where A and B are 2x2 matrices 



a b 
b c 



(12) 



with a, &, c and d being some real numbers. 

Assume that the unitary operator U T is given as an 
exchange interaction: 



1 

1 o 



(13) 



where 1 is the 2x2 unit matrix. 

The Hamiltonian IJ\ H U T in the Heisenberg represen- 
tation at time r then reads 



UlHUr 



B 
A 



(14) 



It is seen from the block-diagonal form of H and \j\ H U T 
that they both have the same eigenvalues. 
As follows from JTll dl EJl , the matrix Q, 



n = 



fo oo o \ 

c-d 

\ d-c/ 



It is now obvious that though 



0i U\HU T 0i) - (0i|iJ m |0i) = 0, for m = l,2, 



one still has 



0i 



uIhu t 



G l )-{Q x \H i \Q l )=b 2 {c-d)^Q 



0i 



0i)- (0i|-ff 4 |0!) 



Ul H U T 



2ab\c-d) + b\c z -d z )^Q 



(17) 



(15) These relations were used in (|96|l . 



